Kernel based part of speech tagger for Kannada

The proposed paper presents the development of a part-of-speech tagger for Kannada language that can be used for analyzing and annotating Kannada texts. POS tagging is considered as one of the basic tool and component necessary for many Natural Language Processing (NLP) applications like speech reco...

Full description

Saved in:

Bibliographic Details
Published in	2010 International Conference on Machine Learning and Cybernetics Vol. 4; pp. 2139 - 2144
Main Authors	Antony, P J, Soman, K P
Format	Conference Proceeding
Language	English
Published	IEEE 01.07.2010
Subjects	Artificial neural networks Classification Classification algorithms Context Kannada Machine learning NLP POS Tagger Support Vector Machine Support vector machines Tagging Training
Online Access	Get full text
ISBN	9781424465262 1424465265
ISSN	2160-133X
DOI	10.1109/ICMLC.2010.5580488

Cover

Loading…

More Information
Summary:	The proposed paper presents the development of a part-of-speech tagger for Kannada language that can be used for analyzing and annotating Kannada texts. POS tagging is considered as one of the basic tool and component necessary for many Natural Language Processing (NLP) applications like speech recognition, natural language parsing, information retrieval and information extraction of a given language. In order to alleviate problems for Kannada language, we proposed a new machine learning POS tagger approach. Identifying the ambiguities in Kannada lexical items is the challenging objective in the process of developing an efficient and accurate POS Tagger. We have developed our own tagset which consist of 30 tags and built a part-of-speech Tagger for Kannada Language using Support Vector Machine (SVM). A corpus of texts, extracted from Kannada news papers and books, is manually morphologically analyzed and tagged using our developed tagset. The performance of the system is evaluated and we found that the result obtained was more efficient and accurate compared with earlier methods for Kannada POS tagging.
ISBN:	9781424465262 1424465265
ISSN:	2160-133X
DOI:	10.1109/ICMLC.2010.5580488