Improving the Accuracy and Effectiveness of Text Classification Based on the Integration of the Bert Model and a Recurrent Neural Network (RNN_Bert_Based)

This paper proposes a new robust model for text classification on the Stanford Sentiment Treebank v2 (SST-2) dataset in terms of model accuracy. We developed a Recurrent Neural Network Bert based (RNN_Bert_based) model designed to improve classification accuracy on the SST-2 dataset. This dataset co...

Full description

Saved in:

Bibliographic Details
Published in	Applied sciences Vol. 14; no. 18; p. 8388
Main Authors	Eang, Chanthol, Lee, Seungjae
Format	Journal Article
Language	English
Published	Basel MDPI AG 01.09.2024
Subjects	Accuracy Algorithms Analysis Bert Classification Computational linguistics Deep learning Emotions KNN_Bert_based Language Language processing Motion pictures Movie reviews Natural language Natural language interfaces Neural networks RNN RNN_Bert_based Semantics Sentiment analysis SST-2 dataset Text analysis Text categorization
Online Access	Get full text

Cover

Loading…

More Information
Summary:	This paper proposes a new robust model for text classification on the Stanford Sentiment Treebank v2 (SST-2) dataset in terms of model accuracy. We developed a Recurrent Neural Network Bert based (RNN_Bert_based) model designed to improve classification accuracy on the SST-2 dataset. This dataset consists of movie review sentences, each labeled with either positive or negative sentiment, making it a binary classification task. Recurrent Neural Networks (RNNs) are effective for text classification because they capture the sequential nature of language, which is crucial for understanding context and meaning. Bert excels in text classification by providing bidirectional context, generating contextual embeddings, and leveraging pre-training on large corpora. This allows Bert to capture nuanced meanings and relationships within the text effectively. Combining Bert with RNNs can be highly effective for text classification. Bert’s bidirectional context and rich embeddings provide a deep understanding of the text, while RNNs capture sequential patterns and long-range dependencies. Together, they leverage the strengths of both architectures, leading to improved performance on complex classification tasks. Next, we also developed an integration of the Bert model and a K-Nearest Neighbor based (KNN_Bert_based) method as a comparative scheme for our proposed work. Based on the results of experimentation, our proposed model outperforms traditional text classification models as well as existing models in terms of accuracy.
ISSN:	2076-3417 2076-3417
DOI:	10.3390/app14188388