Clasificacion de textos en lenguaje natural usando la Wikipedia

Automatic Text Classifiers are needed in environments where the amount of data to handle is so high that human classification would be ineffective. In our study, the proposed classifier takes advantage of the Wikipedia to generate the corpus defining each category. The text is then analyzed syntacti...

Full description

Saved in:

Bibliographic Details
Published in	RISTI : Revista Ibérica de Sistemas e Tecnologias de Informação no. 8; pp. 39 - 52
Main Authors	Quinteiro-Gonzalez, Jose Maria, Martel-Jordan, Ernestina, Hernandez-Morera, Pablo, Ligero-Fleitas, Juan A, Lopez- Rodriguez, Aaron
Format	Journal Article
Language	Portuguese
Published	Lousada AISTI (Iberian Association for Information Systems and Technologies) 01.12.2011 Associação Ibérica de Sistemas e Tecnologias de Informacao AISTI - Associação Ibérica de Sistemas e Tecnologias de Informação
Subjects	COMPUTER SCIENCE, INFORMATION SYSTEMS Aprendizaje Automático Categorización de textos tf-idf Procesado de Lenguaje Natural Natural Language Processing Wikipedia Text Categorization Machine Learning
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Automatic Text Classifiers are needed in environments where the amount of data to handle is so high that human classification would be ineffective. In our study, the proposed classifier takes advantage of the Wikipedia to generate the corpus defining each category. The text is then analyzed syntactically using Natural Language Processing software. The proposed classifier is highly accurate and outperforms Machine Learning trained classifiers.
ISSN:	1646-9895