Transformer-based Arabic Offensive Speech Detection

The prevalence of social media platforms prompted detecting any language that is intended to harm or intimidate another person or group of people in online posts and comments. On Twitter, for instance, users are susceptible to cyberbullying and hate speech, which may develop into physical and psycho...

Full description

Saved in:

Bibliographic Details
Published in	2023 International Conference on Emerging Smart Computing and Informatics (ESCI) pp. 1 - 6
Main Authors	Al-Dabet, Saja, ElMassry, Ahmed, Alomar, Ban, Alshamsi, Abdullah
Format	Conference Proceeding
Language	English
Published	IEEE 01.03.2023
Subjects	Arabic dataset Blogs Conferences Cyberbullying Hate speech Offensive Speech Psychology Speech recognition Transformer Voice activity detection
Online Access	Get full text
DOI	10.1109/ESCI56872.2023.10100134

Cover

More Information
Summary:	The prevalence of social media platforms prompted detecting any language that is intended to harm or intimidate another person or group of people in online posts and comments. On Twitter, for instance, users are susceptible to cyberbullying and hate speech, which may develop into physical and psychological violence. A transformer-based approach is presented in this study to address the offensive speech detection issue. This model employs versions of the CAMeLBERT model and is validated using a mixture of four benchmark Twitter Arabic datasets annotated for hate speech detection task, including the (OSACT5 2022) workshop shared task dataset. The presented model was capable of recognizing Arabic tweets containing offensive speech with 87.15 % accuracy and 83.6 % F1 score.
DOI:	10.1109/ESCI56872.2023.10100134