Analysis of the text of the FQP for automated standard control of documents

Abstract This publication focuses on underdevelopment the possibilities of machine learn-ing to help students prepare their final qualifying paper. Purpose of the study: present the possibilities of machine learning for processing final qualifying paper texts and checking them for compliance with th...

Full description

Saved in:
Bibliographic Details
Published inJournal of physics. Conference series Vol. 2131; no. 2; pp. 22102 - 22108
Main Authors Kozyreva, A, Nazarenko, U, Berezhkov, A, Nasyrov, N
Format Journal Article
LanguageEnglish
Published IOP Publishing 01.12.2021
Online AccessGet full text

Cover

Loading…
More Information
Summary:Abstract This publication focuses on underdevelopment the possibilities of machine learn-ing to help students prepare their final qualifying paper. Purpose of the study: present the possibilities of machine learning for processing final qualifying paper texts and checking them for compliance with the requirements. The article shows the possibilities of distributing work by topic, which can help students in finding materials on their topic and algorithms for extracting and analyzing text in Rus-sian for further analysis. The research is carried out on the basis of the CRISP DM methodology and describes in detail all the necessary research steps. The pa-per shows the process of extracting text from pdf and docx files; the necessary methods of text preprocessing for further analysis; and demonstrates the capabili-ties of machine learning algorithms using the example of LDA analysis.
ISSN:1742-6588
1742-6596
DOI:10.1088/1742-6596/2131/2/022102