Measuring XML Structured-ness with Entropy
dc.contributor.author | TANG, Ruiming | en_US |
dc.contributor.author | WU, Huayu | en_US |
dc.contributor.author | BRESSAN, Stephane | en_US |
dc.date.accessioned | 2011-06-20T09:01:08Z | en_US |
dc.date.accessioned | 2017-01-23T07:00:15Z | |
dc.date.available | 2011-06-20T09:01:08Z | en_US |
dc.date.available | 2017-01-23T07:00:15Z | |
dc.date.issued | 2011-06-03 | en_US |
dc.description.abstract | XML is semi-structured. It can be used to annotate unstructured data, to represent structured data and almost anything in-between. Yet, it is unclear how to formally characterize, yet to quantify, structuredness of XML. In this paper we propose and evaluate entropy-based metrics for XML structured-ness. The metrics measure the structural uniformity of path and subtrees, respectively. We empirically study the correlation of these metrics with real and synthetic data sets. | en_US |
dc.format.extent | 490703 bytes | en_US |
dc.format.mimetype | application/pdf | en_US |
dc.identifier.uri | https://dl.comp.nus.edu.sg/xmlui/handle/1900.100/3445 | en_US |
dc.language.iso | en | en_US |
dc.relation.ispartofseries | TRA6/11 | en_US |
dc.title | Measuring XML Structured-ness with Entropy | en_US |
dc.type | Technical Report | en_US |