Unmasking Toxicity: A Comprehensive Analysis of Hate Speech Detection in Banglish

As the digital landscape expands, the rise of online hate speech presents a pressing challenge, necessitating sophis-ticated tools for effective detection and mitigation. This project focuses on the intricate linguistic landscape of Banglish a hybrid language amalgamating Bengali and English strivin...

Full description

Saved in:

Bibliographic Details
Published in	International Conference on Electrical Engineering and Information & Communication Technology pp. 963 - 968
Main Authors	Islam, Md. Hasibul, Farzana, Kaniz, Khalil, Ibrahim, Ara, Shaneen, Shazid, Md.Ruhul Amin, Kabir Mehedi, Md Humaion
Format	Conference Proceeding
Language	English
Published	IEEE 02.05.2024
Subjects	Adaptation models Banglish computational linguistics content moderation deep learning ethical AI Ethics Hate speech hate speech detection Machine learning algorithms multilingualism Sentiment analysis Support vector machines Toxicology
Online Access	Get full text
ISSN	2769-5700
DOI	10.1109/ICEEICT62016.2024.10534362

Cover

Loading…

More Information
Summary:	As the digital landscape expands, the rise of online hate speech presents a pressing challenge, necessitating sophis-ticated tools for effective detection and mitigation. This project focuses on the intricate linguistic landscape of Banglish a hybrid language amalgamating Bengali and English striving to develop robust models tailored to its unique characteristics. The dataset, comprising 5000 Banglish comments categorized into various hate speech types, serves as the foundation for model exploration. Our approach spans a wide variety of models, including traditional machine learning (SVM, Logistic Regression,random forest), advanced deep learning architectures and innovative hybrid models (CNN+BiLSTM). Approaches for feature extraction such word embedding, TF-IDF, and Bag-of-Words and sentiment analysis scores are adapted to the nuances of Banglish. Ethical considerations guide our development, addressing algorithmic bias and user rights. The experimental results provide a nuanced understanding of model performance, in- cluding accuracy (90%), precision, recall, and F1 score. Insights derived from these analyses contribute to the ongoing refinement of hate speech detection methodologies, advancing the field of computational linguistics and ethical artificial intelligence.
ISSN:	2769-5700
DOI:	10.1109/ICEEICT62016.2024.10534362