Word frequency and sentiment analysis of twitter messages during Coronavirus pandemic

The COVID-19 epidemic has had a great impact on social media conversation, especially on sites like Twitter, which has emerged as a hub for public reaction and information sharing. This paper deals by analyzing a vast dataset of Twitter messages related to this disease, starting from January 2020. T...

Full description

Saved in:
Bibliographic Details
Main Authors Rajput, Nikhil Kumar, Grover, Bhavya Ahuja, Rathi, Vipin Kumar, Bansal, Riya
Format Journal Article
LanguageEnglish
Published 08.04.2020
Subjects
Online AccessGet full text

Cover

Loading…
Abstract The COVID-19 epidemic has had a great impact on social media conversation, especially on sites like Twitter, which has emerged as a hub for public reaction and information sharing. This paper deals by analyzing a vast dataset of Twitter messages related to this disease, starting from January 2020. Two approaches were used: a statistical analysis of word frequencies and a sentiment analysis to gauge user attitudes. Word frequencies are modeled using unigrams, bigrams, and trigrams, with power law distribution as the fitting model. The validity of the model is confirmed through metrics like Sum of Squared Errors (SSE), R-squared ($R^2$), and Root Mean Squared Error (RMSE). High $R^2$ and low SSE/RMSE values indicate a good fit for the model. Sentiment analysis is conducted to understand the general emotional tone of Twitter users messages. The results reveal that a majority of tweets exhibit neutral sentiment polarity, with only 2.57\% expressing negative polarity.
AbstractList The COVID-19 epidemic has had a great impact on social media conversation, especially on sites like Twitter, which has emerged as a hub for public reaction and information sharing. This paper deals by analyzing a vast dataset of Twitter messages related to this disease, starting from January 2020. Two approaches were used: a statistical analysis of word frequencies and a sentiment analysis to gauge user attitudes. Word frequencies are modeled using unigrams, bigrams, and trigrams, with power law distribution as the fitting model. The validity of the model is confirmed through metrics like Sum of Squared Errors (SSE), R-squared ($R^2$), and Root Mean Squared Error (RMSE). High $R^2$ and low SSE/RMSE values indicate a good fit for the model. Sentiment analysis is conducted to understand the general emotional tone of Twitter users messages. The results reveal that a majority of tweets exhibit neutral sentiment polarity, with only 2.57\% expressing negative polarity.
Author Bansal, Riya
Rathi, Vipin Kumar
Grover, Bhavya Ahuja
Rajput, Nikhil Kumar
Author_xml – sequence: 1
  givenname: Nikhil Kumar
  surname: Rajput
  fullname: Rajput, Nikhil Kumar
– sequence: 2
  givenname: Bhavya Ahuja
  surname: Grover
  fullname: Grover, Bhavya Ahuja
– sequence: 3
  givenname: Vipin Kumar
  surname: Rathi
  fullname: Rathi, Vipin Kumar
– sequence: 4
  givenname: Riya
  surname: Bansal
  fullname: Bansal, Riya
BackLink https://doi.org/10.48550/arXiv.2004.03925$$DView paper in arXiv
BookMark eNotj81OwzAQhH2AAxQegBN-gYRtHCf2EUX8SZW4FHGM1vG6stQ4xU4KeXtC6WFmNIcZ6btmF2EIxNjdGvJSSQkPGH_8MS8AyhyELuQV-_gcouUu0tdEoZs5BssThdH3iy0N93PyiQ-Oj99-HCnynlLCHSVup-jDjjdDHAIefZwSPyxz6n13wy4d7hPdnnPFts9P2-Y127y_vDWPmwyrWmaIFgGkFMY5YXHRmgqgCsAah1QqBwZrTZ2utNKCTKeEUFIbDbKuKy1W7P7_9sTVHqLvMc7tH1974hO_0OhO7A
ContentType Journal Article
Copyright http://arxiv.org/licenses/nonexclusive-distrib/1.0
Copyright_xml – notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0
DBID AKY
GOX
DOI 10.48550/arxiv.2004.03925
DatabaseName arXiv Computer Science
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 2004_03925
GroupedDBID AKY
GOX
ID FETCH-LOGICAL-a675-aada00553bff3daf3d1e20e600dbfae48f0ba79ec969893ebc833859b90577693
IEDL.DBID GOX
IngestDate Wed Jun 05 12:17:34 EDT 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a675-aada00553bff3daf3d1e20e600dbfae48f0ba79ec969893ebc833859b90577693
OpenAccessLink https://arxiv.org/abs/2004.03925
ParticipantIDs arxiv_primary_2004_03925
PublicationCentury 2000
PublicationDate 2020-04-08
PublicationDateYYYYMMDD 2020-04-08
PublicationDate_xml – month: 04
  year: 2020
  text: 2020-04-08
  day: 08
PublicationDecade 2020
PublicationYear 2020
Score 1.7649463
SecondaryResourceType preprint
Snippet The COVID-19 epidemic has had a great impact on social media conversation, especially on sites like Twitter, which has emerged as a hub for public reaction and...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Computation and Language
Computer Science - Information Retrieval
Computer Science - Social and Information Networks
Title Word frequency and sentiment analysis of twitter messages during Coronavirus pandemic
URI https://arxiv.org/abs/2004.03925
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV09T8MwED21nVgQCFD5lAfWQOo4TTyiilIhAUsrulXnxEYZ-qH0g_LvubODYGFwJDue7oZ7L7l7D-A27yFxHUZueYKRQrQRe1tFCVpUJsPYeEuWl9f-aKKep-m0BeJnFgbrfbUL-sBmfc8pvIuphKdtaEvJLVtPb9Pwc9JLcTX3f-8RxvRHf4rE8AgOG3QnHkI6jqFlFycweSeCJ1wdmpa_BHF3wSM_XlefdkEVRCyd2HxWPFwj5uxL8mHXIgwRigHLDOCuqrdrseKvvvOqOIXx8HE8GEWNm0GEBMojxBJZ8CoxziUl0upZGVvCG6VxaFXuYoOZtoVmS8fEmiIn9phqowlRsWHhGXQWy4XtgogLiYk0KnNpqaTVGgk3xKVWPXrk_ewcuj4Gs1UQrGCrSTXz4bn4_9UlHEjmktyVkl9BZ1Nv7TUV3I258VH_BqHpgfU
link.rule.ids 228,230,783,888
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Word+frequency+and+sentiment+analysis+of+twitter+messages+during+Coronavirus+pandemic&rft.au=Rajput%2C+Nikhil+Kumar&rft.au=Grover%2C+Bhavya+Ahuja&rft.au=Rathi%2C+Vipin+Kumar&rft.au=Bansal%2C+Riya&rft.date=2020-04-08&rft_id=info:doi/10.48550%2Farxiv.2004.03925&rft.externalDocID=2004_03925