Fighting the COVID-19 Infodemic in News Articles and False Publications: The NeoNet Text Classifier, a Supervised Machine Learning Algorithm

The spread of the Coronavirus pandemic has been accompanied by an infodemic. The false information that is embedded in the infodemic affects people’s ability to have access to safety information and follow proper procedures to mitigate the risks. This research aims to target the falsehood part of th...

Full description

Saved in:

Bibliographic Details
Published in	Applied sciences Vol. 11; no. 16; p. 7265
Main Authors	Abdeen, Mohammad A. R., Hamed, Ahmed Abdeen, Wu, Xindong
Format	Journal Article
Language	English
Published	Basel MDPI AG 01.08.2021
Subjects	Algorithms Artificial intelligence Coronaviruses COVID-19 COVID-19 infodemic Documents Learning algorithms Machine learning misinformation network training modes Neural networks Social networks supervised learning text classification TF-IDF features China
Online Access	Get full text

Cover

Loading…

More Information
Summary:	The spread of the Coronavirus pandemic has been accompanied by an infodemic. The false information that is embedded in the infodemic affects people’s ability to have access to safety information and follow proper procedures to mitigate the risks. This research aims to target the falsehood part of the infodemic, which prominently proliferates in news articles and false medical publications. Here, we present NeoNet, a novel supervised machine learning algorithm that analyzes the content of a document (news article, a medical publication) and assigns a label to it. The algorithm was trained by Term Frequency Inverse Document Frequency (TF-IDF) bigram features, which contribute a network training model. The algorithm was tested on two different real-world datasets from the CBC news network and COVID-19 publications. In five different fold comparisons, the algorithm predicted a label of an article with a precision of 97–99%. When compared with prominent algorithms such as Neural Networks, SVM, and Random Forests NeoNet surpassed them. The analysis highlighted the promise of NeoNet in detecting disputed online contents, which may contribute negatively to the COVID-19 pandemic.
ISSN:	2076-3417 2076-3417
DOI:	10.3390/app11167265