MetaHate: A Dataset for Unifying Efforts on Hate Speech Detection (2401.06526v1)
Abstract: Hate speech represents a pervasive and detrimental form of online discourse, often manifested through an array of slurs, from hateful tweets to defamatory posts. As such speech proliferates, it connects people globally and poses significant social, psychological, and occasionally physical threats to targeted individuals and communities. Current computational linguistic approaches for tackling this phenomenon rely on labelled social media datasets for training. For unifying efforts, our study advances in the critical need for a comprehensive meta-collection, advocating for an extensive dataset to help counteract this problem effectively. We scrutinized over 60 datasets, selectively integrating those pertinent into MetaHate. This paper offers a detailed examination of existing collections, highlighting their strengths and limitations. Our findings contribute to a deeper understanding of the existing datasets, paving the way for training more robust and adaptable models. These enhanced models are essential for effectively combating the dynamic and complex nature of hate speech in the digital realm.
- Pinpointing Fine-Grained Relationships between Hateful Tweets and Replies. Proceedings of the AAAI 2022, 36(10): 10418–10426.
- SemEval-2019 Task 5: Multilingual Detection of Hate Speech Against Immigrants and Women in Twitter. In Proceedings of the 13th International Workshop on Semantic Evaluation, 54–63. ACL.
- The Pushshift Reddit Dataset. Proceedings of the ICWSM 2020, 14: 830–839.
- Developing a Multilingual Annotated Corpus of Misogyny and Aggression. In Proceedings of the TRAC 2020, 158–168. ELRA.
- Using the Reddit Corpus for Cyberbully Detection, 180–189. Springer. ISBN 9783319754178.
- HateMM: A Multi-Modal Dataset for Hate Video Classification. Proceedings of the ICWSM 2023, 17: 1014–1023.
- Automated Hate Speech Detection and the Problem of Offensive Language. Proceedings of the ICWSM 2017, 11(1): 512–515.
- Hate Speech Dataset from a White Supremacy Forum. In Proceedings of the ALW2 2018. ACL.
- Hate Lingo: A Target-Based Linguistic Analysis of Hate Speech in Social Media. Proceedings of the ICWSM 2018, 12(1).
- Peer to Peer Hate: Hate Speech Instigators and Their Targets. Proceedings of the ICWSM 2018, 12(1).
- Overview of the Evalita 2018 Task on Automatic Misogyny Identification (AMI), 59–66. Accademia University Press. ISBN 9788831978699.
- A Hierarchically-Labeled Portuguese Hate Speech Dataset. In Proceedings of the ALW 2019, 94–104. ACL.
- Large Scale Crowdsourcing and Characterization of Twitter Abusive Behavior. Proceedings of the ICWSM 2018, 12(1).
- MeTooMA: Multi-Aspect Annotations of Tweets Related to the MeToo Movement. Proceedings of the ICWSM 2020, 14: 209–216.
- A Large Labeled Corpus for Online Harassment Research. In Proceedings of the ACM WebSci 2017. ACM.
- Hate Towards the Political Opponent: A Twitter Corpus Study of the 2020 US Elections on the Basis of Offensive Speech and Stance Detection. In Proceedings of the WASSA 2021, 171–180. ACL.
- Hateful Comment Detection and Hate Target Type Prediction for Video Comments. In Proceedings of the CIKM 2023, CIKM ’23, 3923–3927. ACM. ISBN 9798400701245.
- Auditing Elon Musk’s Impact on Hate Speech and Bots. In Proceedings of the ICWSM 2023, 1133–1137. AAAI.
- When does a compliment become sexist? Analysis and classification of ambivalent sexism using twitter data. In Proceedings of the Second Workshop on NLP + CSS, 7–16. ACL.
- A systematic review on hate speech among children and adolescents: definitions, prevalence, and overlap with related phenomena. Trauma, violence, & abuse, 24(4): 2598–2615.
- Introducing the Gab Hate Corpus: defining and applying hate-based rhetoric to social media posts at scale. Language Resources and Evaluation, 56(1): 79–108.
- Constructing interval variables via faceted Rasch measurement and multitask deep learning: a hate speech application.
- Aggression-annotated Corpus of Hindi-English Code-mixed Data. In Proceedings of the LREC 2018. Miyazaki, Japan: ELRA.
- Towards a Comprehensive Taxonomy and Large-Scale Annotated Corpus for Online Slur Usage. In Proceedings of the WOAH 2020, 138–149. Online: ACL.
- The FRENK Datasets of Socially Unacceptable Discourse in Slovene and English. In Text, Speech, and Dialogue, 103–114. Springer. ISBN 978-3-030-27947-9.
- Overview of the HASOC Track at FIRE 2020: Hate Speech and Offensive Language Identification in Tamil, Malayalam, Hindi, English and German. In Proceedings of the FIRE 2020, FIRE 2020. ACM.
- Overview of the HASOC track at FIRE 2019: Hate Speech and Offensive Content Identification in Indo-European Languages. In Proceedings of the FIRE 2019, FIRE ’19. ACM.
- Spread of Hate Speech in Online Social Media. In Proceedings of the ACM WebSci 2019, WebSci ’19. ACM.
- HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection. In Proceedings of the AAAI 2020.
- Overview of the HASOC Subtrack at FIRE 2021: Hate Speech and Offensive Content Identification in English and Indo-Aryan Languages and Conversational Hate Speech. In Proceedings of the FIRE 2021, FIRE 2021. ACM.
- A curated dataset for hate speech detection on social media text. Data in Brief, 46: 108832.
- Crowdsourcing a Word-Emotion Association Lexicon. Computational Intelligence, 29(3): 436–465.
- ETHOS: a multi-label hate speech detection dataset. Complex & Intelligent Systems, 8(6): 4663–4678.
- A Measurement Study of Hate Speech in Social Media. In Proceedings of the ACM HT, HT ’17, 85–94. ACM. ISBN 9781450347082.
- Nations, U. 2023. What is hate speech? Accessed: 15/11/2023.
- Multilingual and Multi-Aspect Hate Speech Analysis. In Proceedings of the EMNLP-IJCNLP 2019, 4675–4684. ACL.
- Toxicity Detection: Does Context Really Matter?
- SemEval-2021 Task 5: Toxic Spans Detection. In Proceedings of the SemEval 2021, 59–69. ACL.
- Detecting and Monitoring Hate Speech in Twitter. Sensors, 19(21).
- Plutchik, R. 1980. A general psychoevolutionary theory of emotion. Theories of emotion, 1: 3–31.
- Resources and benchmark corpora for hate speech detection: a systematic review. Language Resources and Evaluation, 55(2): 477–523.
- A Benchmark Dataset for Learning to Intervene in Online Hate Speech. In Proceedings of the EMNLP-IJCNLP 2019, 4755–4764. ACL.
- HateCheck: Functional Tests for Hate Speech Detection Models. In Proceedings of the ACL-IJCNLP, 41–58. ACL.
- The Measuring Hate Speech Corpus: Leveraging Rasch Measurement Theory for Data Perspectivism. In Proceedings of the LREC 2022, 83–94. ELRA.
- Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for Identifying and Classifying Hate in Online News Media. Proceedings of the ICWSM 2018, 12(1).
- “Call me sexist, but…” : Revisiting Sexism Detection Using Psychological Scales and Adversarial Samples. Proceedings of the ICWSM 2021, 15: 573–584.
- HaSpeeDe 2 @ EVALITA2020: Overview of the EVALITA 2020 Hate Speech Detection Task, 93–101. Accademia University Press.
- An Italian Twitter Corpus of Hate Speech against Immigrants. In Proceedings of the LREC 2018. ELRA.
- Analyzing the Targets of Hate in Online Social Media. Proceedings of the ICWSM 2021, 10(1): 687–690.
- Spertus, E. 1997. Smokey: Automatic Recognition of Hostile Messages. In AAAI/IAAI.
- Large-Scale Hate Speech Detection with Cross-Domain Transfer. In Proceedings of the LREC 2022, 2215–2225. ELRA.
- Visualizing Data using t-SNE. Journal of Machine Learning Research, 9(86): 2579–2605.
- Directions in abusive language training data, a systematic review: Garbage in, garbage out. PLOS ONE, 15(12): e0243300.
- Introducing CAD: the Contextual Abuse Dataset. In Proceedings of the NAACL 2021, 2289–2303. ACL.
- Vogels, E. A. 2021. The state of online harassment.
- Waseem, Z. 2016. Are You a Racist or Am I Seeing Things? Annotator Influence on Hate Speech Detection on Twitter. In Proceedings of the NLP + CSS 2016, 138–142. ACL.
- Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter. In Proceedings of the NAACL 2016. ACL.
- Ex Machina: Personal Attacks Seen at Scale.
- Predicting the Type and Target of Offensive Posts in Social Media. In Proceedings of the NAACL 2019, 1415–1420. ACL.