An Experimental Study of Supervised Sentiment Analysis Using Gaussian Naïve Bayes

In this millennial generation everyone using technology at higher rates than people from other generations. It means the millennial generation is aware of evolving technology. Many companies are taking chances by receiving customer reviews through applications. In this study, we use customer reviews...

Full description

Saved in:
Bibliographic Details
Published in2018 International Seminar on Application for Technology of Information and Communication pp. 476 - 481
Main Authors Widodo Wijayanto, Unggul, Sarno, Riyanarto
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.09.2018
Subjects
Online AccessGet full text
DOI10.1109/ISEMANTIC.2018.8549788

Cover

Abstract In this millennial generation everyone using technology at higher rates than people from other generations. It means the millennial generation is aware of evolving technology. Many companies are taking chances by receiving customer reviews through applications. In this study, we use customer reviews from Yelp (foods), IMDb (movies) and Amazon (products). The reviews received by the company are numerous. Product management does not have much time to read customer reviews one by one. So, to speed up the reading of customer reviews we were using sentiment analysis. There are many methods that used in sentiment analysis such as supervised sentiment analysis. We used TF-IDF to convert word to features implements the supervised method. Performance of the supervised method depends on the data training quality. So, to improve the accuracy of the results by improving data training quality. The methods used to improve the data training quality in this paper using CHI2 Features Selection and Stopwords. In this study, we use K-folds Cross-validation to get valid results. This study proves the use of Context-based Stopwords can improve the results. Context-based Stopwords enrich the number of Stopwords that removing bias features.
AbstractList In this millennial generation everyone using technology at higher rates than people from other generations. It means the millennial generation is aware of evolving technology. Many companies are taking chances by receiving customer reviews through applications. In this study, we use customer reviews from Yelp (foods), IMDb (movies) and Amazon (products). The reviews received by the company are numerous. Product management does not have much time to read customer reviews one by one. So, to speed up the reading of customer reviews we were using sentiment analysis. There are many methods that used in sentiment analysis such as supervised sentiment analysis. We used TF-IDF to convert word to features implements the supervised method. Performance of the supervised method depends on the data training quality. So, to improve the accuracy of the results by improving data training quality. The methods used to improve the data training quality in this paper using CHI2 Features Selection and Stopwords. In this study, we use K-folds Cross-validation to get valid results. This study proves the use of Context-based Stopwords can improve the results. Context-based Stopwords enrich the number of Stopwords that removing bias features.
Author Widodo Wijayanto, Unggul
Sarno, Riyanarto
Author_xml – sequence: 1
  givenname: Unggul
  surname: Widodo Wijayanto
  fullname: Widodo Wijayanto, Unggul
  organization: Faculty of Technology Information and Communication Institut Teknologi Sepuluh, Informatics Department, Nopember Surabaya, Indonesia
– sequence: 2
  givenname: Riyanarto
  surname: Sarno
  fullname: Sarno, Riyanarto
  organization: Faculty of Technology Information and Communication Institut Teknologi Sepuluh, Informatics Department, Nopember Surabaya, Indonesia
BookMark eNotj11Kw0AURkdQ0NauQJDZQOL8ZTLzGEOtgVrBxOdyTW5kJE5LJylmVS7CjRm0Tx98Bw6cGTn3O4-E3HIWc87sXVEun7JNVeSxYNzEJlE2NeaMzHgijU6V0fqSLEL4YIwJbaSy8oq8ZJ4uv_Z4cJ_oe-ho2Q_NSHctLYfpPbqADS0n9Mdp5qEbgwv0NTj_TlcwhODA0w38fB-R3sOI4ZpctNAFXJx2TqqHZZU_RuvnVZFn68hZ1keN1UoLw2tMWs4tpDWkSQocoJYWGhAKBZN1I9rkrQVVS4Sm1iCEtGxKATknN_9ah4jb_RQAh3F7qpa_3ElSpA
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/ISEMANTIC.2018.8549788
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE/IET Electronic Library
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE/IET Electronic Library
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISBN 1538674866
9781538674864
EndPage 481
ExternalDocumentID 8549788
Genre orig-research
GroupedDBID 6IE
6IF
6IL
6IN
AAJGR
AAWTH
ABLEC
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
IEGSK
OCL
RIE
RIL
ID FETCH-LOGICAL-i90t-d9646281ce5f119a7ca757a1aac39ada24e203cd2f5bfa4c3eadc6a22390486a3
IEDL.DBID RIE
IngestDate Wed Aug 27 02:51:39 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i90t-d9646281ce5f119a7ca757a1aac39ada24e203cd2f5bfa4c3eadc6a22390486a3
PageCount 6
ParticipantIDs ieee_primary_8549788
PublicationCentury 2000
PublicationDate 2018-September
PublicationDateYYYYMMDD 2018-09-01
PublicationDate_xml – month: 09
  year: 2018
  text: 2018-September
PublicationDecade 2010
PublicationTitle 2018 International Seminar on Application for Technology of Information and Communication
PublicationTitleAbbrev ISEMANTIC
PublicationYear 2018
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0002683493
Score 1.7760675
Snippet In this millennial generation everyone using technology at higher rates than people from other generations. It means the millennial generation is aware of...
SourceID ieee
SourceType Publisher
StartPage 476
SubjectTerms cross-validation
data training
Feature extraction
features selection
Machine learning
Mathematical model
Motion pictures
Seminars
Sentiment analysis
stopwords
supervised
text mining
Training
Title An Experimental Study of Supervised Sentiment Analysis Using Gaussian Naïve Bayes
URI https://ieeexplore.ieee.org/document/8549788
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1da8IwFL2oT9vLPnTsmzzsca22Tdvk0YlOB8qYDnyT2ySFMaii7cD9qf2I_bElbXVu7GFvISEk5Ibc3OSccwFuWCRCJqljRZQzy0igW4zLwEIMqNDxREwLgOwo6D_Th6k_rcDtlgujlMrBZ8o2xfwvX85FZp7Kmsw3-dBYFap6mxVcre17ihswj3KvJAE7Ld4cjLvD9mgy6BgAF7PLzj-yqOROpHcAw83wBXbk1c7SyBbvv5QZ_zu_Q2h80_XI49YRHUFFJcewv6M0WIendkK6O2L-xOAH12Qek3G2MMfFSkkyNsgh0042UiUkRxSQe8xWhmxJRvj58abIHa7VqgGTXnfS6VtlOgXrhbdSS_LA8FAdofzYcTiGAkM_RAdReBwlulS5LU9IN_ajGKnw9B4TAerrAzeyfOidQC2ZJ-oUiL4DCa7jQN1NUB2gce5HYcxRBGHEmORnUDeLM1sUghmzcl3O_66-gD1joAK4dQm1dJmpK-3p0-g6N_EXRMyqQA
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1dT8IwFG0QH9QXP8D4bR98dINt3dY-IgFBYTEyE97IXdslxmQQ2UzwT_kj_GO220A0PvjWtGly0za9vbfnnIvQFY24TwWxjIgwamgJdIMy4RkAHuEqnohJAZANvN4TuRu74wq6XnFhpJQ5-Eyaupn_5Yspz3SqrEFdXQ-NbqBN5feJW7C1VhkV26MOYU5JA7aarNEfdYatIOy3NYSLmuX0H3VUcjfS3UXDpQEFeuTFzNLI5O-_tBn_a-Eeqn8T9vDDyhXto4pMDtDOmtZgDT22EtxZk_PHGkG4wNMYj7KZvjDmUuCRxg7pcbwUK8E5pgDfQjbXdEscwOfHm8Q3sJDzOgq7nbDdM8qCCsYza6aGYJ5molpcurFlMfA5-K4PFgB3GAiwibSbDhd27EYxEO6oU8Y9UA8IpoX5wDlE1WSayCOE1SuIMxUJqmmcqBCNMTfyYwbc8yNKBTtGNb04k1khmTEp1-Xk7-5LtNULh4PJoB_cn6JtvVkFjOsMVdPXTJ4rv59GF_l2fwFboa2N
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2018+International+Seminar+on+Application+for+Technology+of+Information+and+Communication&rft.atitle=An+Experimental+Study+of+Supervised+Sentiment+Analysis+Using+Gaussian+Na%C3%AFve+Bayes&rft.au=Widodo+Wijayanto%2C+Unggul&rft.au=Sarno%2C+Riyanarto&rft.date=2018-09-01&rft.pub=IEEE&rft.spage=476&rft.epage=481&rft_id=info:doi/10.1109%2FISEMANTIC.2018.8549788&rft.externalDocID=8549788