A semantic approach to XML schema integration
No Thumbnail Available
Date
2004-09-01T00:00:00Z
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
In this paper, we adopt a semantic rich model, Object-Relationship-Attribute model for Semi-Structured data (or ORASS) to represent XML schemas. A challenge in XML schema integration comes from the hierarchical structure of XML. For example, two sets of XML elements from two sources may constitute the same relationship type, but in different hierarchies. Then in the integrated schema, we need decide a "good" hierarchy of these elements. In general, we require an integrated schema preserves the semantics of source schemas, has minimum redundancy and leads to low cost data transformation. Guided by these criteria, we developed algorithms to merge equivalent elements and equivalent relationship types among elements from source schemas, and proposed a top-down approach to integrate ORASS schemas, meeting challenges caused by the hierarchical structure of XML.