Get a Sample for a Discount Sampling-Based XML Data Pricing

dc.contributor.authorTANG, Ruimingen_US
dc.contributor.authorAMARILLI, Antoineen_US
dc.contributor.authorSENELLART, Pierreen_US
dc.contributor.authorBRESSAN, Stéphaneen_US
dc.date.accessioned2014-03-12T08:54:12Zen_US
dc.date.accessioned2017-01-23T07:00:08Z
dc.date.available2014-03-12T08:54:12Zen_US
dc.date.available2017-01-23T07:00:08Z
dc.date.issued2014-03-12en_US
dc.description.abstractWhile price and data quality should define the major tradeoff for consumers in data markets, prices are usually prescribed by vendors and data quality is not negotiable. In this paper we study a model where data quality can be traded for a discount. We focus on the case of XML documents and consider completeness as the quality dimension. In our setting, the data provider offers an XML document, and sets both the price of the document and a weight to each node of the document, depending on its potential worth. The data consumer proposes a price. If the proposed price is lower than that of the entire document, then the data consumer receives a sample, i.e., a random rooted subtree of the document whose selection depends on the discounted price and the weight of nodes. By requesting several samples, the data consumer can iteratively explore the data in the document. We show that the uniform random sampling of a rooted subtree with prescribed weight is unfortunately intractable. However, we are able to identify several practical cases that are tractable. The first case is uniform random sampling of a rooted subtree with prescribed size; the second case restricts to binary weights. For both these practical cases we present polynomial-time algorithms and explain how they can be integrated into an iterative exploratory sampling approach.en_US
dc.format.extent501788 bytesen_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.urihttps://dl.comp.nus.edu.sg/xmlui/handle/1900.100/4362en_US
dc.language.isoenen_US
dc.relation.ispartofseries;TRA3/2014en_US
dc.titleGet a Sample for a Discount Sampling-Based XML Data Pricingen_US
dc.typeTechnical Reporten_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
TRA3-14.pdf
Size:
490.03 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.53 KB
Format:
Plain Text
Description: