A Hybrid Model Based on Convolutional Neural Network and Long Short-Term Memory for Multi-label Text Classification

Multi-label text classification (MLTC) is a popular method for organizing electronic documents, which is crucial for accessing and processing data. As the number of classes increases, learning multi-label data will be challenging. The number of possible states for various labels increases exponentia...

Full description

Saved in:

Bibliographic Details
Published in	Neural processing letters Vol. 56; no. 2; p. 42
Main Authors	Maragheh, Hamed Khataei, Gharehchopogh, Farhad Soleimanian, Majidzadeh, Kambiz, Sangar, Amin Babazadeh
Format	Journal Article
Language	English
Published	New York Springer US 16.02.2024 Springer Nature B.V
Subjects	Accuracy Algorithms Artificial Intelligence Artificial neural networks Classification Complex Systems Computational Intelligence Computer Science Data processing Datasets Deep learning Electronic documents Hybridization Labels Machine learning Model accuracy Natural language Neural networks Search algorithms Text categorization Multi-label text classification Long short-term memory Competitive search algorithm Convolutional neural network
Online Access	Get full text
ISSN	1573-773X 1370-4621 1573-773X
DOI	10.1007/s11063-024-11500-8

Cover

More Information
Summary:	Multi-label text classification (MLTC) is a popular method for organizing electronic documents, which is crucial for accessing and processing data. As the number of classes increases, learning multi-label data will be challenging. The number of possible states for various labels increases exponentially, and learning algorithms in single-label data cannot be used to solve these problems. In the meantime, using single-label data algorithms could be very time-consuming. In MLTC, complexity costs should be reduced. Deep-learning neural networks that can learn intricate patterns are used in many real-world problems because of their high power and accuracy. This paper proposed a hybridization of the long short-term memory (LSTM) neural network and the convolutional neural network (CNN) method for MLTC. The proposed model uses LSTM to enhance CNN to improve the proposed model’s accuracy. Also, the competitive search algorithm (CSA) is used to improve the LSTM hyperparameters. The LSTM hyperparameters play an important role in increasing the detection accuracy. The CSA algorithm finds the best values for the hyperparameters by searching the problem space. It was tested on four different datasets of multi-label texts: Reuters-21578, RCV1-v2, EUR-Lex, and Bookmarks. The result showed that the proposed model performed better than CNN and LSTM-CSA in terms of accuracy percentage and that it has improved by an average of more than 10%. Also, the results show that the LSTM-CSA model has higher detection accuracy compared to LSTM—Gradient-based optimizer (GBO) and LSTM—whale optimization algorithm (WOA).
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	1573-773X 1370-4621 1573-773X
DOI:	10.1007/s11063-024-11500-8