DSpace Repository

Measuring XML Structured-ness with Entropy

Show simple item record

dc.contributor.author TANG, Ruiming en_US
dc.contributor.author WU, Huayu en_US
dc.contributor.author BRESSAN, Stephane en_US
dc.date.accessioned 2011-06-20T09:01:08Z en_US
dc.date.accessioned 2017-01-23T07:00:15Z
dc.date.available 2011-06-20T09:01:08Z en_US
dc.date.available 2017-01-23T07:00:15Z
dc.date.issued 2011-06-03 en_US
dc.identifier.uri http://hdl.handle.net/1900.100/3445 en_US
dc.description.abstract XML is semi-structured. It can be used to annotate unstructured data, to represent structured data and almost anything in-between. Yet, it is unclear how to formally characterize, yet to quantify, structuredness of XML. In this paper we propose and evaluate entropy-based metrics for XML structured-ness. The metrics measure the structural uniformity of path and subtrees, respectively. We empirically study the correlation of these metrics with real and synthetic data sets. en_US
dc.format.extent 490703 bytes en_US
dc.format.mimetype application/pdf en_US
dc.language.iso en en_US
dc.relation.ispartofseries TRA6/11 en_US
dc.title Measuring XML Structured-ness with Entropy en_US
dc.type Technical Report en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Browse

My Account