Classification of sentiment reviews using n-gram machine learning approach

•A large number of sentiment reviews, blogs and comments present online.•These reviews must be classified to obtain a meaningful information.•Four different supervised machine learning algorithm used for classification.•Unigram, Bigram, Trigram models and their combinations used for classification.•...

Full description

Saved in:
Bibliographic Details
Published inExpert systems with applications Vol. 57; pp. 117 - 126
Main Authors Tripathy, Abinash, Agrawal, Ankit, Rath, Santanu Kumar
Format Journal Article
LanguageEnglish
Published Elsevier Ltd 15.09.2016
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:•A large number of sentiment reviews, blogs and comments present online.•These reviews must be classified to obtain a meaningful information.•Four different supervised machine learning algorithm used for classification.•Unigram, Bigram, Trigram models and their combinations used for classification.•The classification is done on IMDb movie review dataset. With the ever increasing social networking and online marketing sites, the reviews and blogs obtained from those, act as an important source for further analysis and improved decision making. These reviews are mostly unstructured by nature and thus, need processing like classification or clustering to provide a meaningful information for future uses. These reviews and blogs may be classified into different polarity groups such as positive, negative, and neutral in order to extract information from the input dataset. Supervised machine learning methods help to classify these reviews. In this paper, four different machine learning algorithms such as Naive Bayes (NB), Maximum Entropy (ME), Stochastic Gradient Descent (SGD), and Support Vector Machine (SVM) have been considered for classification of human sentiments. The accuracy of different methods are critically examined in order to access their performance on the basis of parameters such as precision, recall, f-measure, and accuracy.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0957-4174
1873-6793
DOI:10.1016/j.eswa.2016.03.028