Browsing by Author "Min-Yen KAN"
Now showing 1 - 1 of 1
Results Per Page
Sort Options
- ItemMetadata extraction and text categorization using Universal Resource Locator expansions(2003-10-01T00:00:00Z) Min-Yen KANUniform resource locators (URLs), which mark the address of a resource on the World Wide Web, are often human-readable and can indicate metadata about a resource. This paper explores the mining of URLs to yield categoric metadata about web resources via a three-phase pipeline of word segmentation, abbreviation expansion and classification. I apply this approach to the problem of subject metadata generation and quantify its performance relative to title- and document-based methods, both which require the retrieval of the source document.