Clasificacion de textos en lenguaje natural usando la Wikipedia

Automatic Text Classifiers are needed in environments where the amount of data to handle is so high that human classification would be ineffective. In our study, the proposed classifier takes advantage of the Wikipedia to generate the corpus defining each category. The text is then analyzed syntacti...

Full description

Saved in:
Bibliographic Details
Published inRISTI : Revista Ibérica de Sistemas e Tecnologias de Informação no. 8; pp. 39 - 52
Main Authors Quinteiro-Gonzalez, Jose Maria, Martel-Jordan, Ernestina, Hernandez-Morera, Pablo, Ligero-Fleitas, Juan A, Lopez- Rodriguez, Aaron
Format Journal Article
LanguagePortuguese
Published Lousada AISTI (Iberian Association for Information Systems and Technologies) 01.12.2011
Associação Ibérica de Sistemas e Tecnologias de Informacao
AISTI - Associação Ibérica de Sistemas e Tecnologias de Informação
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Automatic Text Classifiers are needed in environments where the amount of data to handle is so high that human classification would be ineffective. In our study, the proposed classifier takes advantage of the Wikipedia to generate the corpus defining each category. The text is then analyzed syntactically using Natural Language Processing software. The proposed classifier is highly accurate and outperforms Machine Learning trained classifiers.
ISSN:1646-9895