Optimization of Database Operations in the Application for Text Corpus Analysis

The user's comfortable work with large volumes of text is ensured by optimizing the web application. This optimization is primarily focused on optimizing work with the database. The SQL query language is an example of a declarative language that allows for effective query optimization.The artic...

Full description

Saved in:
Bibliographic Details
Published in2024 20th International Asian School-Seminar on Optimization Problems of Complex Systems (OPCS) pp. 1 - 6
Main Authors Barakhnin, Vladimir B., Karpov, Matvey V., Machikina, Elena P., Musasbayev, Rustam R.
Format Conference Proceeding
LanguageEnglish
Published IEEE 19.07.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The user's comfortable work with large volumes of text is ensured by optimizing the web application. This optimization is primarily focused on optimizing work with the database. The SQL query language is an example of a declarative language that allows for effective query optimization.The article formalizes the optimization problem as improving the quality of application usage by increasing the user's productivity in the system. This is achieved by reducing the reaction time of the system in response to user actions.There are three main strategies for query optimization: modifying the source code of the program that interacts with the database, modifying the structure of queries, and modifying the structure of the database itself. The technologies used to create the optimized software system include: the application was written in Python using the Flask web framework. Work with the database was done using the SQLAlchemy library and PostgreSQL was chosen as the DBMS (Database Management System). To test the functionality of the optimized queries, the pytest library was used.Specific optimization examples were presented for the following situations: long execution time for any stage of a data request can be solved by changing the query structure by adding an index. Long execution time for any stage of data change can be solved by deleting the index from the query. A large number of queries within a cycle can be replaced with an equivalent query outside the loop to improve performance.The results of the computational experiments demonstrate the high efficiency of the optimization technique.
DOI:10.1109/OPCS63516.2024.10720387