A CNN-BiLSTM Model for Document-Level Sentiment Analysis

Document-level sentiment analysis is a challenging task given the large size of the text, which leads to an abundance of words and opinions, at times contradictory, in the same document. This analysis is particularly useful in analyzing press articles and blog posts about a particular product or com...

Full description

Saved in:
Bibliographic Details
Published inMachine learning and knowledge extraction Vol. 1; no. 3; pp. 832 - 847
Main Authors Rhanoui, Maryem, Mikram, Mounia, Yousfi, Siham, Barzali, Soukaina
Format Journal Article
LanguageEnglish
Published 01.09.2019
Online AccessGet full text
ISSN2504-4990
2504-4990
DOI10.3390/make1030048

Cover

Loading…
More Information
Summary:Document-level sentiment analysis is a challenging task given the large size of the text, which leads to an abundance of words and opinions, at times contradictory, in the same document. This analysis is particularly useful in analyzing press articles and blog posts about a particular product or company, and it requires a high concentration, especially when the topic being discussed is sensitive. Nevertheless, most existing models and techniques are designed to process short text from social networks and collaborative platforms. In this paper, we propose a combination of Convolutional Neural Networks (CNN) and Bidirectional Long Short-Term Memory (BiLSTM) models, with Doc2vec embedding, suitable for opinion analysis in long texts. The CNN-BiLSTM model is compared with CNN, LSTM, BiLSTM and CNN-LSTM models with Word2vec/Doc2vec embeddings. The Doc2vec with CNN-BiLSTM model was applied on French newspapers articles and outperformed the other models with 90.66% accuracy.
ISSN:2504-4990
2504-4990
DOI:10.3390/make1030048