Analysis of the text of the FQP for automated standard control of documents
Abstract This publication focuses on underdevelopment the possibilities of machine learn-ing to help students prepare their final qualifying paper. Purpose of the study: present the possibilities of machine learning for processing final qualifying paper texts and checking them for compliance with th...
Saved in:
Published in | Journal of physics. Conference series Vol. 2131; no. 2; pp. 22102 - 22108 |
---|---|
Main Authors | , , , |
Format | Journal Article |
Language | English |
Published |
IOP Publishing
01.12.2021
|
Online Access | Get full text |
Cover
Loading…
Summary: | Abstract
This publication focuses on underdevelopment the possibilities of machine learn-ing to help students prepare their final qualifying paper. Purpose of the study: present the possibilities of machine learning for processing final qualifying paper texts and checking them for compliance with the requirements. The article shows the possibilities of distributing work by topic, which can help students in finding materials on their topic and algorithms for extracting and analyzing text in Rus-sian for further analysis. The research is carried out on the basis of the CRISP DM methodology and describes in detail all the necessary research steps. The pa-per shows the process of extracting text from pdf and docx files; the necessary methods of text preprocessing for further analysis; and demonstrates the capabili-ties of machine learning algorithms using the example of LDA analysis. |
---|---|
ISSN: | 1742-6588 1742-6596 |
DOI: | 10.1088/1742-6596/2131/2/022102 |