Effective XML Keyword Search with Nearest Common Object Node Semantics

dc.contributor.authorLE, Thuy Ngocen_US
dc.contributor.authorLING, Tok Wangen_US
dc.contributor.authorLIN, Chunbinen_US
dc.contributor.authorLU, Jiahengen_US
dc.date.accessioned2012-09-20T06:19:17Zen_US
dc.date.accessioned2017-01-23T07:00:06Z
dc.date.available2012-09-20T06:19:17Zen_US
dc.date.available2017-01-23T07:00:06Z
dc.date.issued2012-09-20en_US
dc.description.abstractLowest Common Ancestor (LCA) semantics and its extensions such as SLCA, MLCA, VLCA and ELCA. However, these approaches commonly do not return a complete answer set for a query because they can only find the common ancestors of a set of keywords but cannot find their common information appearing at their descendants in an XML document. In this paper, we introduce a new semantics, called Nearest Common Objects Node (NCON), which guarantees that both common ancestors and common descendants are included in the answer set for a query and therefore enables us to answer a query more completely. We also propose an NCON-based approach for XML keyword search, which exploits not only the index of the original XML document, but also the index of its reversed XML document, and devise optimization techniques to facilitate the process of finding NCONs. We have developed XComplete, a system for our NCON-based approach, which essentially uses the NCON semantics and post-processing techniques, altogether enable XComplete to return an answer set with completeness, meaningfulness, no irrelevance, no duplicate and comprehension to users. The results of our extensive experiments show that our proposed approach outperforms the existing LCA-based approaches in terms of both effectiveness and efficiency.en_US
dc.format.extent954730 bytesen_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.urihttps://dl.comp.nus.edu.sg/xmlui/handle/1900.100/3839en_US
dc.language.isoenen_US
dc.relation.ispartofseries;TRA9/12en_US
dc.titleEffective XML Keyword Search with Nearest Common Object Node Semanticsen_US
dc.typeTechnical Reporten_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
TRA9-12.pdf
Size:
932.35 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.53 KB
Format:
Plain Text
Description: