Transformer-based Arabic Offensive Speech Detection

The prevalence of social media platforms prompted detecting any language that is intended to harm or intimidate another person or group of people in online posts and comments. On Twitter, for instance, users are susceptible to cyberbullying and hate speech, which may develop into physical and psycho...

Full description

Saved in:
Bibliographic Details
Published in2023 International Conference on Emerging Smart Computing and Informatics (ESCI) pp. 1 - 6
Main Authors Al-Dabet, Saja, ElMassry, Ahmed, Alomar, Ban, Alshamsi, Abdullah
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.03.2023
Subjects
Online AccessGet full text
DOI10.1109/ESCI56872.2023.10100134

Cover

More Information
Summary:The prevalence of social media platforms prompted detecting any language that is intended to harm or intimidate another person or group of people in online posts and comments. On Twitter, for instance, users are susceptible to cyberbullying and hate speech, which may develop into physical and psychological violence. A transformer-based approach is presented in this study to address the offensive speech detection issue. This model employs versions of the CAMeLBERT model and is validated using a mixture of four benchmark Twitter Arabic datasets annotated for hate speech detection task, including the (OSACT5 2022) workshop shared task dataset. The presented model was capable of recognizing Arabic tweets containing offensive speech with 87.15 % accuracy and 83.6 % F1 score.
DOI:10.1109/ESCI56872.2023.10100134