Language morphology offset: Text classification on a Croatian–English parallel corpus
We investigate how, and to what extent, morphological complexity of the language influences text classification using support vector machines (SVM). The Croatian–English parallel corpus provides the basis for direct comparison of two languages of radically different morphological complexity. We quan...
Saved in:
Published in | Information processing & management Vol. 44; no. 1; pp. 325 - 339 |
---|---|
Main Authors | , , , |
Format | Journal Article |
Language | English |
Published |
Kidlington
Elsevier Ltd
2008
Elsevier Elsevier Science Ltd |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Be the first to leave a comment!