Automatic Labeling of Topics

An algorithm for the automatic labeling of topics accordingly to a hierarchy is presented. Its main ingredients are a set of similarity measures and a set of topic labeling rules. The labeling rules are specifically designed to find the most agreed labels between the given topic and the hierarchy. T...

Full description

Saved in:
Bibliographic Details
Published in2009 Ninth International Conference on Intelligent Systems Design and Applications pp. 1227 - 1232
Main Authors Magatti, D., Calegari, S., Ciucci, D., Stella, F.
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.11.2009
Subjects
Online AccessGet full text
ISBN1424447356
9781424447350
ISSN2164-7143
DOI10.1109/ISDA.2009.165

Cover

Loading…
More Information
Summary:An algorithm for the automatic labeling of topics accordingly to a hierarchy is presented. Its main ingredients are a set of similarity measures and a set of topic labeling rules. The labeling rules are specifically designed to find the most agreed labels between the given topic and the hierarchy. The hierarchy is obtained from the Google Directory service, extracted via an ad-hoc developed software procedure and expanded through the use of the OpenOffice English Thesaurus. The performance of the proposed algorithm is investigated by using a document corpus consisting of 33,801 documents and a dictionary consisting of 111,795 words. The results are encouraging, while particularly interesting and significant labeling cases emerged.
ISBN:1424447356
9781424447350
ISSN:2164-7143
DOI:10.1109/ISDA.2009.165