City University of Hong Kong

CityU Institutional Repository >
3_CityU Electronic Theses and Dissertations >
ETD - Dept. of Computer Science  >
CS - Doctor of Philosophy  >

Please use this identifier to cite or link to this item:

Title: A structured export markup language interpreter for exporting data between relational and XML databases : an ETL approach
Other Titles: Yi jie gou hua biao ji yu yan ji qi jie shi qi chu li guan xi shu ju ku he biao ji yu yan shu ju ku zhi jian de shu ju jiao liu
Authors: Shiu, Hoi Cheung ( 邵海祥)
Department: Department of Computer Science
Degree: Doctor of Philosophy
Issue Date: 2011
Publisher: City University of Hong Kong
Subjects: Database management
Relational databases.
XML (Document markup language)
Interpreters (Computer programs)
Notes: CityU Call Number: QA76.9.D3 S539 2011
248 leaves : ill. 30 cm.
Thesis (Ph.D.)--City University of Hong Kong, 2011.
Includes bibliographical references (leaves 171-181)
Type: thesis
Abstract: Most enterprises use relational databases to handle real time business operations. They also need to generate various XML documents for data exchanges internally among various departments or externally with business partners for e-commerce on the Internet. It is necessary for enterprises to handle XML documents, which are stored in XML database, but most production data must be extracted and stored in a relational database. As such, there are relational databases for handling relational data and XML databases for manipulating XML documents. These two database systems operate with different data definition languages and data manipulation languages. This dichotomy of practice presents a burden to software developers who must master the database languages for these two database systems. In addition to the data manipulation issues, exporting data from a relational database into an XML document and vice versa are common practice, which is considered a data export process, and also an export-transform-loading (ETL) problem between these two data models. The complication is that the Relational and XML models are fundamentally different. The Relational database employs a network data structure and the XML database employs a hierarchical tree data structure. Furthermore, they use different mechanisms for entity identities. There are existing solutions for handling different database models and also for solving ETL problems between them, but they are proprietary and not user-friendly. Among the four approaches for data export of customized program, interpretive transformer, translator generator, and logical level translation, this dissertation proposes a new markup language, namely Structured Export Markup Language (SEML) and the associated interpretive approach for converting relational data into XML documents, and vice versa. This approach is XML-based, user-friendly, and comprehensive for data export and can be used as a generic tool for ETL purposes. In other words, the SEML interpreter can handle ETL operations in a generic and user friendly way with a view of an input schema or a target schema. The data dependency constraints of the source database can be preserved after data export to the target database, that is, between Relational and XML data.
Online Catalog Link:
Appears in Collections:CS - Doctor of Philosophy

Files in This Item:

File Description SizeFormat
abstract.html132 BHTMLView/Open
fulltext.html132 BHTMLView/Open

Items in CityU IR are protected by copyright, with all rights reserved, unless otherwise indicated.


Valid XHTML 1.0!
DSpace Software © 2013 CityU Library - Send feedback to Library Systems
Privacy Policy · Copyright · Disclaimer