Sara Detection on Social Media Using Deep Learning Algorithm Development

Social media has become a key platform for disseminating information and opinions, particularly in Indonesia, where SARA (Ethnicity, Religion, Race, and Intergroup) issues can fuel social tensions. To address this, developing an automated system to detect and classify harmful content is essential. T...

Full description

Saved in:

Bibliographic Details
Published in	Journal of Applied Engineering and Technological Science (Online) Vol. 6; no. 1; pp. 225 - 237
Main Authors	Anam, M. Khairul, Van FC, Lucky Lhaura, Hamdani, Hamdani, Rahmaddeni, Rahmaddeni, Junadhi, Junadhi, Firdaus, Muhammad Bambang, Syahputra, Irwanda, Irawan, Yuda
Format	Journal Article
Language	English
Published	Yayasan Pendidikan Riset dan Pengembangan Intelektual (YRPI) 15.12.2024
Subjects	Deep Learning SARA Comments SARA Detection SMOTE Social Media Classification
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Social media has become a key platform for disseminating information and opinions, particularly in Indonesia, where SARA (Ethnicity, Religion, Race, and Intergroup) issues can fuel social tensions. To address this, developing an automated system to detect and classify harmful content is essential. This study develops a deep learning model using Convolutional Neural Network (CNN) and Bidirectional Long Short-Term Memory (BiLSTM) to detect SARA-related comments on Twitter. The method involves data collection through web scraping, followed by cleaning, manual labeling, and text preprocessing. To address data imbalance, SMOTE (Synthetic Minority Over-sampling Technique) is applied, while early stopping prevents overfitting. Model performance is evaluated using precision, recall, and F1-score. The results demonstrate that SMOTE significantly improves model performance, particularly in detecting minority-class SARA comments. CNN+SMOTE achieves a accuracy of 93%, and BiLSTM+SMOTE records a recall of 88%, effectively capturing patterns in SARA and non-SARA data. With SMOTE and early stopping, the model successfully manages class imbalance and reduces overfitting. This research supports efforts to curtail hate speech on social media, especially in the Indonesian context, where SARA-related issues often dominate public discourse.
ISSN:	2715-6087 2715-6079
DOI:	10.37385/jaets.v6i1.5390