Language morphology offset: Text classification on a Croatian–English parallel corpus

We investigate how, and to what extent, morphological complexity of the language influences text classification using support vector machines (SVM). The Croatian–English parallel corpus provides the basis for direct comparison of two languages of radically different morphological complexity. We quan...

Full description

Saved in:
Bibliographic Details
Published inInformation processing & management Vol. 44; no. 1; pp. 325 - 339
Main Authors Malenica, M., Šmuc, T., Šnajder, J., Dalbelo Bašić, B.
Format Journal Article
LanguageEnglish
Published Kidlington Elsevier Ltd 2008
Elsevier
Elsevier Science Ltd
Subjects
Online AccessGet full text

Cover

Loading…