Clasificacion de textos en lenguaje natural usando la Wikipedia
Automatic Text Classifiers are needed in environments where the amount of data to handle is so high that human classification would be ineffective. In our study, the proposed classifier takes advantage of the Wikipedia to generate the corpus defining each category. The text is then analyzed syntacti...
Saved in:
Published in | RISTI : Revista Ibérica de Sistemas e Tecnologias de Informação no. 8; pp. 39 - 52 |
---|---|
Main Authors | , , , , |
Format | Journal Article |
Language | Portuguese |
Published |
Lousada
AISTI (Iberian Association for Information Systems and Technologies)
01.12.2011
Associação Ibérica de Sistemas e Tecnologias de Informacao AISTI - Associação Ibérica de Sistemas e Tecnologias de Informação |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Automatic Text Classifiers are needed in environments where the amount of data to handle is so high that human classification would be ineffective. In our study, the proposed classifier takes advantage of the Wikipedia to generate the corpus defining each category. The text is then analyzed syntactically using Natural Language Processing software. The proposed classifier is highly accurate and outperforms Machine Learning trained classifiers. |
---|---|
ISSN: | 1646-9895 |