Automatic identification of assamese and bodo multiword expressions

Multiword Expressions (MWEs) are sequence of words separated by space or delimiter which determines a unique meaning instead of words' individual meanings. Our work concentrates on automatic identification of MWEs for two less computationally aware languages Assamese and Bodo spoken in the Nort...

Full description

Saved in:
Bibliographic Details
Published in2013 International Conference on Advances in Computing, Communications and Informatics (ICACCI) pp. 26 - 30
Main Authors Barman, Anup Kumar, Sarmah, Jumi, Sarma, Shikhar Kr
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.08.2013
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Multiword Expressions (MWEs) are sequence of words separated by space or delimiter which determines a unique meaning instead of words' individual meanings. Our work concentrates on automatic identification of MWEs for two less computationally aware languages Assamese and Bodo spoken in the North Eastern part of India. Statistical measure and Language specific knowledge helps us to extract MWEs from raw corpus. Natural Language Processing tasks in Assamese and Bodo languages have started in recent years, and this is the first organised approach to exploit MWEs in both these languages. Linguistics aspects for analysing the results have been considered, and we have found the results quite satisfactory.
ISBN:9781479924325
1479924326
DOI:10.1109/ICACCI.2013.6637141