Show simple item record

dc.contributor.authorZhang, Dell
dc.contributor.authorLee, Wee Sun
dc.date.accessioned2003-12-13T19:41:16Z
dc.date.available2003-12-13T19:41:16Z
dc.date.issued2004-01
dc.identifier.urihttp://hdl.handle.net/1721.1/3867
dc.description.abstractWe address the problem of integrating objects from a source taxonomy into a master taxonomy. This problem is not only pervasive on the nowadays web, but also important to the emerging semantic web. A straightforward approach to automating this process would be to train a classifier for each category in the master taxonomy, and then classify objects from the source taxonomy into these categories. In this paper we attempt to use a powerful classification method, Support Vector Machine (SVM), to attack this problem. Our key insight is that the availability of the source taxonomy data could be helpful to build better classifiers in this scenario, therefore it would be beneficial to do transductive learning rather than inductive learning, i.e., learning to optimize classification performance on a particular set of test examples. Noticing that the categorization of the master and source taxonomies often have some semantic overlap, we propose a new method, Cluster Shrinkage (CS), to further enhance the classification by exploiting such implicit knowledge. Our experiments with real-world web data show substantial improvements in the performance of taxonomy integration.en
dc.description.sponsorshipSingapore-MIT Alliance (SMA)en
dc.format.extent106014 bytes
dc.format.mimetypeapplication/pdf
dc.language.isoen_US
dc.relation.ispartofseriesComputer Science (CS);
dc.subjectweb taxonomy integrationen
dc.subjectclassificationen
dc.subjectsupport vector machinesen
dc.subjecttransductive learningen
dc.titleOn Web Taxonomy Integrationen
dc.typeArticleen


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record