Automatic identification of assamese and bodo multiword expressions
Multiword Expressions (MWEs) are sequence of words separated by space or delimiter which determines a unique meaning instead of words' individual meanings. Our work concentrates on automatic identification of MWEs for two less computationally aware languages Assamese and Bodo spoken in the Nort...
Saved in:
Published in | 2013 International Conference on Advances in Computing, Communications and Informatics (ICACCI) pp. 26 - 30 |
---|---|
Main Authors | , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.08.2013
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Multiword Expressions (MWEs) are sequence of words separated by space or delimiter which determines a unique meaning instead of words' individual meanings. Our work concentrates on automatic identification of MWEs for two less computationally aware languages Assamese and Bodo spoken in the North Eastern part of India. Statistical measure and Language specific knowledge helps us to extract MWEs from raw corpus. Natural Language Processing tasks in Assamese and Bodo languages have started in recent years, and this is the first organised approach to exploit MWEs in both these languages. Linguistics aspects for analysing the results have been considered, and we have found the results quite satisfactory. |
---|---|
ISBN: | 9781479924325 1479924326 |
DOI: | 10.1109/ICACCI.2013.6637141 |