Automatic grading for Arabic short answer questions using optimized deep learning model

Auto-grading of short answer questions is considered a challenging problem in the processing of natural language. It requires a system to comprehend the free text answers to automatically assign a grade for a student answer compared to one or more model answers. This paper suggests an optimized deep...

Full description

Saved in:

Bibliographic Details
Published in	PloS one Vol. 17; no. 8; p. e0272269
Main Authors	Abdul Salam, Mustafa, El-Fatah, Mohamed Abd, Hassan, Naglaa Fathy
Format	Journal Article
Language	English
Published	San Francisco Public Library of Science 02.08.2022 Public Library of Science (PLoS)
Subjects	Algorithms Analysis Arabic language Biology and Life Sciences Computational linguistics Computer and Information Sciences Correlation Correlation coefficient Correlation coefficients Datasets Deep learning Engineering and Technology Evaluation Hybrid systems Language processing Long short-term memory Machine learning Modelling Natural language interfaces Natural language processing Optimization Optimization techniques Physical Sciences Questions Research and Analysis Methods Root-mean-square errors Semantics Social Sciences Students Egypt
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Auto-grading of short answer questions is considered a challenging problem in the processing of natural language. It requires a system to comprehend the free text answers to automatically assign a grade for a student answer compared to one or more model answers. This paper suggests an optimized deep learning model for grading short-answer questions automatically by using various sizes of datasets collected in the Science subject for students in seventh grade in Egypt. The proposed system is a hybrid approach that optimizes a deep learning technique called LSTM (Long Short Term Memory) with a recent optimization algorithm called a Grey Wolf Optimizer (GWO). The GWO is employed to optimize the LSTM by selecting the best dropout and recurrent dropout rates of LSTM hyperparameters rather than manual choice. Using GWO makes the LSTM model more generalized and can also avoid the problem of overfitting in forecasting the students’ scores to improve the learning process and save instructors’ time and effort. The model’s performance is measured in terms of the Root Mean Squared Error (RMSE), the Pearson correlation coefficient, and R-Square. According to the simulation results, the hybrid GWO with the LSTM model ensured the best performance and outperformed the classical LSTM model and other compared models such that it had the highest Pearson correlation coefficient value, the lowest RMSE value, and the best R square value in all experiments, but higher training time than the traditional deep learning model.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 Competing Interests: The authors have declared that no competing interests exist.
ISSN:	1932-6203 1932-6203
DOI:	10.1371/journal.pone.0272269