ASA: A framework for Arabic sentiment analysis

Sentiment analysis (SA), also known as opinion mining, is a growing important research area. Generally, it helps to automatically determine if a text expresses a positive, negative or neutral sentiment. It enables to mine the huge increasing resources of shared opinions such as social networks, revi...

Full description

Saved in:

Bibliographic Details
Published in	Journal of information science Vol. 46; no. 4; pp. 544 - 559
Main Authors	Oussous, Ahmed, Benjelloun, Fatima-Zahra, Lahcen, Ayoub Ait, Belfkih, Samir
Format	Journal Article
Language	English
Published	London, England SAGE Publications 01.08.2020 Bowker-Saur Ltd
Subjects	Artificial neural networks Data mining Deep learning Machine learning Maximum entropy Morphology Natural language processing Performance enhancement Preprocessing Sentiment analysis Social networks Support vector machines deep learning machine-learning Arabic sentiment analysis convolutional neural network long short-term memory
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Sentiment analysis (SA), also known as opinion mining, is a growing important research area. Generally, it helps to automatically determine if a text expresses a positive, negative or neutral sentiment. It enables to mine the huge increasing resources of shared opinions such as social networks, review sites and blogs. In fact, SA is used by many fields and for various languages such as English and Arabic. However, since Arabic is a highly inflectional and derivational language, it raises many challenges. In fact, SA of Arabic text should handle such complex morphology. To better handle these challenges, we decided to provide the research community and Arabic users with a new efficient framework for Arabic Sentiment Analysis (ASA). Our primary goal is to improve the performance of ASA by exploiting deep learning while varying the preprocessing techniques. For that, we implement and evaluate two deep learning models namely convolutional neural network (CNN) and long short-term memory (LSTM) models. The framework offers various preprocessing techniques for ASA (including stemming, normalisation, tokenization and stop words). As a result of this work, we first provide a new rich and publicly available Arabic corpus called Moroccan Sentiment Analysis Corpus (MSAC). Second, the proposed framework demonstrates improvement in ASA. In fact, the experimental results prove that deep learning models have a better performance for ASA than classical approaches (support vector machines, naive Bayes classifiers and maximum entropy). They also show the key role of morphological features in Arabic Natural Language Processing (NLP).
ISSN:	0165-5515 1741-6485
DOI:	10.1177/0165551519849516