Comparative Study of Machine Learning Algorithms and Text Vectorization Methods for Fake News Detection

The detection of fake news is a crucial task in today's society, given the widespread use of social media and online platforms. In this study, we investigate the application of Machine Learning (ML) algorithms for the detection of fake news. We consider two different datasets of categorized new...

Full description

Saved in:

Bibliographic Details
Published in	2023 14th International Conference on Information, Intelligence, Systems & Applications (IISA) pp. 1 - 8
Main Authors	Kanavos, Andreas, Karamitsos, Ioannis, Mohasseb, Alaa, Gerogiannis, Vassilis C.
Format	Conference Proceeding
Language	English
Published	IEEE 10.07.2023
Subjects	Classification algorithms Fake news Fake News Detection Information Retrieval Machine Learning Machine learning algorithms Naive Bayes methods Social networking (online) Support vector machines Task analysis Text Mining
Online Access	Get full text

Cover

Loading…

More Information
Summary:	The detection of fake news is a crucial task in today's society, given the widespread use of social media and online platforms. In this study, we investigate the application of Machine Learning (ML) algorithms for the detection of fake news. We consider two different datasets of categorized news articles of various sizes and apply various ML algorithms, along with two methods of text vectorization. Specifically, we examine Bag of Words and Tf-Idf, with the use of stemming and with different n-gram values. The resulting vectors are processed by Naive Bayes algorithms, Linear algorithms, Support Vector Machines, and Random Forest classifiers. F1-score and computational time for each algorithm-vectorization combination were recorded. Our results have shown that Linear Algorithms and Support Vector Machines combined with Tf-Idf vectors and n-gram value of (1,2) produced the highest accuracies, with an F1-score up to 96.8%.
DOI:	10.1109/IISA59645.2023.10345953