Imbalanced Learning Techniques for Improving the Performance of Statistical Models in Automated Essay Scoring
The problem of class imbalance occurs in many application domains, as in the case of essays. The imbalance of the number or samples among the classes presents a problem for predictive algorithms, both for classification or for regression. In this paper, we present imbalanced learning techniques for...
Saved in:
Published in | Procedia computer science Vol. 159; pp. 764 - 773 |
---|---|
Main Authors | , , , , , |
Format | Journal Article |
Language | English |
Published |
Elsevier B.V
2019
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | The problem of class imbalance occurs in many application domains, as in the case of essays. The imbalance of the number or samples among the classes presents a problem for predictive algorithms, both for classification or for regression. In this paper, we present imbalanced learning techniques for improving the performance of statistical models in automated essay grading. We analyze the performance of different types of statistical algorithms using machine learning and classification techniques, combined with different balancing methods. The results indicated a significant improvement of accuracy and average error when the algorithms were applied in a balanced corpus, comparing with the unbalanced ones. |
---|---|
ISSN: | 1877-0509 1877-0509 |
DOI: | 10.1016/j.procs.2019.09.235 |