Improved whale optimization algorithm for feature selection in Arabic sentiment analysis

To help individuals or companies make a systematic and more accurate decisions, sentiment analysis (SA) is used to evaluate the polarity of reviews. In SA, feature selection phase is an important phase for machine learning classifiers specifically when the datasets used in training is huge. Whale Op...

Full description

Saved in:
Bibliographic Details
Published inApplied intelligence (Dordrecht, Netherlands) Vol. 49; no. 5; pp. 1688 - 1707
Main Authors Tubishat, Mohammad, Abushariah, Mohammad A. M., Idris, Norisma, Aljarah, Ibrahim
Format Journal Article
LanguageEnglish
Published New York Springer US 01.05.2019
Springer Nature B.V
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:To help individuals or companies make a systematic and more accurate decisions, sentiment analysis (SA) is used to evaluate the polarity of reviews. In SA, feature selection phase is an important phase for machine learning classifiers specifically when the datasets used in training is huge. Whale Optimization Algorithm (WOA) is one of the recent metaheuristic optimization algorithm that mimics the whale hunting mechanism. However, WOA suffers from the same problem faced by many other optimization algorithms and tend to fall in local optima. To overcome these problems, two improvements for WOA algorithm are proposed in this paper. The first improvement includes using Elite Opposition-Based Learning (EOBL) at initialization phase of WOA. The second improvement involves the incorporation of evolutionary operators from Differential Evolution algorithm at the end of each WOA iteration including mutation, crossover, and selection operators. In addition, we also used Information Gain (IG) as a filter features selection technique with WOA using Support Vector Machine (SVM) classifier to reduce the search space explored by WOA. To verify our proposed approach, four Arabic benchmark datasets for sentiment analysis are used since there are only a few studies in sentiment analysis conducted for Arabic language as compared to English. The proposed algorithm is compared with six well-known optimization algorithms and two deep learning algorithms. The comprehensive experiments results show that the proposed algorithm outperforms all other algorithms in terms of sentiment analysis classification accuracy through finding the best solutions, while its also minimizes the number of selected features.
ISSN:0924-669X
1573-7497
DOI:10.1007/s10489-018-1334-8