Comparative Study of Machine Learning Algorithms and Text Vectorization Methods for Fake News Detection

The detection of fake news is a crucial task in today's society, given the widespread use of social media and online platforms. In this study, we investigate the application of Machine Learning (ML) algorithms for the detection of fake news. We consider two different datasets of categorized new...

Full description

Saved in:
Bibliographic Details
Published in2023 14th International Conference on Information, Intelligence, Systems & Applications (IISA) pp. 1 - 8
Main Authors Kanavos, Andreas, Karamitsos, Ioannis, Mohasseb, Alaa, Gerogiannis, Vassilis C.
Format Conference Proceeding
LanguageEnglish
Published IEEE 10.07.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The detection of fake news is a crucial task in today's society, given the widespread use of social media and online platforms. In this study, we investigate the application of Machine Learning (ML) algorithms for the detection of fake news. We consider two different datasets of categorized news articles of various sizes and apply various ML algorithms, along with two methods of text vectorization. Specifically, we examine Bag of Words and Tf-Idf, with the use of stemming and with different n-gram values. The resulting vectors are processed by Naive Bayes algorithms, Linear algorithms, Support Vector Machines, and Random Forest classifiers. F1-score and computational time for each algorithm-vectorization combination were recorded. Our results have shown that Linear Algorithms and Support Vector Machines combined with Tf-Idf vectors and n-gram value of (1,2) produced the highest accuracies, with an F1-score up to 96.8%.
DOI:10.1109/IISA59645.2023.10345953