Optimizing social media analytics with the data quality enhancement and analytics framework for superior data quality

his paper introduces the data quality enhancement and analytics (DQEA) framework to enhance data quality in social media analytics through machine learning (ML) algorithms. The efficacy of the framework is validated through features tested against human coders on Amazon Mechanical Turk, achieving an...

Full description

Saved in:
Bibliographic Details
Published inInternational journal of reconfigurable and embedded systems Vol. 14; no. 2; p. 472
Main Authors Karthick, B., Meyyappan, T.
Format Journal Article
LanguageEnglish
Published 01.07.2025
Online AccessGet full text

Cover

Loading…
More Information
Summary:his paper introduces the data quality enhancement and analytics (DQEA) framework to enhance data quality in social media analytics through machine learning (ML) algorithms. The efficacy of the framework is validated through features tested against human coders on Amazon Mechanical Turk, achieving an inter-coder reliability score of 0.85, indicating high agreement. Furthermore, two case studies with a large social media dataset from Tumblr were conducted to demonstrate the effectiveness of the proposed content features. In the first case study, the DQEA framework reduced data noise by 30% and bias by 25%, while increasing completeness by 20%. In the second case study, the framework improved data consistency by 35% and overall data quality score by 28%. Comparative analysis with state-of-the-art models, including random forest and support vector machines (SVM), showed significant improvements in data reliability and decision-making accuracy. Specifically, the DQEA framework outperformed the random forest model by 15% in accuracy and 20% in true positive rate, and the SVM model by 10% in error rate reduction and 18% in reliability. The results underscore the potential of advanced data analytics tools in transforming social media data into a valuable asset for organizations, highlighting the practical implications and future research directions in this domain.
ISSN:2089-4864
2722-2608
DOI:10.11591/ijres.v14.i2.pp472-480