Measuring XML Structured-ness with Entropy

dc.contributor.authorTANG, Ruimingen_US
dc.contributor.authorWU, Huayuen_US
dc.contributor.authorBRESSAN, Stephaneen_US
dc.date.accessioned2011-06-20T09:01:08Zen_US
dc.date.accessioned2017-01-23T07:00:15Z
dc.date.available2011-06-20T09:01:08Zen_US
dc.date.available2017-01-23T07:00:15Z
dc.date.issued2011-06-03en_US
dc.description.abstractXML is semi-structured. It can be used to annotate unstructured data, to represent structured data and almost anything in-between. Yet, it is unclear how to formally characterize, yet to quantify, structuredness of XML. In this paper we propose and evaluate entropy-based metrics for XML structured-ness. The metrics measure the structural uniformity of path and subtrees, respectively. We empirically study the correlation of these metrics with real and synthetic data sets.en_US
dc.format.extent490703 bytesen_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.urihttps://dl.comp.nus.edu.sg/xmlui/handle/1900.100/3445en_US
dc.language.isoenen_US
dc.relation.ispartofseriesTRA6/11en_US
dc.titleMeasuring XML Structured-ness with Entropyen_US
dc.typeTechnical Reporten_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
TRA6-11.pdf
Size:
479.2 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.53 KB
Format:
Plain Text
Description: