Deep Learning Models for Multilingual Hate Speech Detection (2004.06465v3)

Published 14 Apr 2020 in cs.SI and cs.CL

Abstract: Hate speech detection is a challenging problem with most of the datasets available in only one language: English. In this paper, we conduct a large scale analysis of multilingual hate speech in 9 languages from 16 different sources. We observe that in low resource setting, simple models such as LASER embedding with logistic regression performs the best, while in high resource setting BERT based models perform better. In case of zero-shot classification, languages such as Italian and Portuguese achieve good results. Our proposed framework could be used as an efficient solution for low-resource languages. These models could also act as good baselines for future multilingual hate speech detection tasks. We have made our code and experimental settings public for other researchers at https://github.com/punyajoy/DE-LIMIT.

Citations (137)

View on Semantic Scholar

Summary

The paper demonstrates that deep learning models enhance hate speech detection through extensive monolingual and cross-lingual experiments.
It compares various approaches, including MUSE+CNN-GRU, Translation+BERT, LASER+LR, and mBERT, revealing strengths in low- and high-resource scenarios.
The study highlights promising zero-shot performance in languages like Italian and Portuguese, suggesting a path for more inclusive moderation tools.

Overview of Deep Learning Models for Multilingual Hate Speech Detection

This paper, presented by Aluru et al., undertakes a comprehensive paper of multilingual hate speech detection using deep learning methodologies. Given the critical issue of hate speech dissemination via online platforms, the importance of accurately detecting these harmful narratives across multiple languages has garnered significant attention in recent computational linguistics research. The authors thus concentrate on analyzing the efficacy of state-of-the-art deep learning models in detecting hate speech across nine different languages utilizing 16 diverse datasets.

Methodology

The research delineates two experimental scenarios for model evaluation: monolingual and multilingual configurations. Monolingual settings train and test models using data from the same language, while multilingual setups involve training with all languages except one and testing on the excluded language. This dual approach facilitates an understanding of cross-lingual transfer capabilities and model performance variability contingent on language.

Several models constitute the nucleus of this paper, specifically:

MUSE + CNN-GRU: This configuration uses MUSE embeddings passed through a Convolutional Neural Network combined with a Gated Recurrent Unit for classification.
Translation + BERT: Beyond leveraging Google's neural machine translation to align non-English languages to English, this model fine-tunes BERT for predictions.
LASER + LR: By extracting sentence embeddings via LASER, these are fed into a Logistic Regression classifier.
mBERT: A multilingual derivative of BERT that directly processes input without translation, learning contextual nuances across languages.

Results and Observations

Through rigorous experimental analysis, several noteworthy findings emerged:

Low-Resource Effectiveness: The LASER + LR model demonstrated superior performance in low-resource environments across most languages. This suggests sentence embeddings combined with simple classifiers can effectively mitigate data paucity.
High-Resource Efficiency: BERT-based models significantly outperformed others when abundant datasets were available, indicating their potency in effectively scaling with linguistic resources.
Zero-Shot Learning Potency: Some languages, such as Italian and Portuguese, achieved commendable results under zero-shot learning configurations, evidencing robust cross-lingual capabilities intrinsic to multilingual models.

Despite these insights, challenges persist, notably related to language-specific annotation biases and model dependency on lexical markers which may not always correlate aptly with hate speech semantics.

Implications and Future Directions

The implications of this paper resonate across both practical and theoretical dimensions. Practically, the proposed multilingual frameworks furnish a foundation for deploying hate speech detectors in low-resource languages, thus aiding platforms like Facebook in broadening their moderation capabilities beyond prevalent languages like English or Mandarin. Theoretically, these findings propel future exploration into fine-grained, context-aware models that can transcend superficial lexical indicators and more precisely interpret the socio-linguistic context of hateful narratives.

In concluding, Aluru et al.'s work underscores the heterogeneous challenges in multilingual hate speech detection, advancing the field through an empirical baseline that fellow researchers are encouraged to extend and enhance. As multilingual AI continues to evolve, further explorations might encompass more nuanced error analysis, improved language translations, and integration of socio-cultural factors influencing hate speech retention across digital networks.

PDF Markdown

Related Papers

GitHub

GitHub - hate-alert/DE-LIMIT: DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets. (109 stars)