Papers
Topics
Authors
Recent
Detailed Answer
Quick Answer
Concise responses based on abstracts only
Detailed Answer
Well-researched responses based on abstracts and relevant paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses
Gemini 2.5 Flash
Gemini 2.5 Flash 47 tok/s
Gemini 2.5 Pro 44 tok/s Pro
GPT-5 Medium 13 tok/s Pro
GPT-5 High 12 tok/s Pro
GPT-4o 64 tok/s Pro
Kimi K2 160 tok/s Pro
GPT OSS 120B 452 tok/s Pro
Claude Sonnet 4 37 tok/s Pro
2000 character limit reached

nlpBDpatriots at BLP-2023 Task 1: A Two-Step Classification for Violence Inciting Text Detection in Bangla (2311.15029v1)

Published 25 Nov 2023 in cs.CL

Abstract: In this paper, we discuss the nlpBDpatriots entry to the shared task on Violence Inciting Text Detection (VITD) organized as part of the first workshop on Bangla Language Processing (BLP) co-located with EMNLP. The aim of this task is to identify and classify the violent threats, that provoke further unlawful violent acts. Our best-performing approach for the task is two-step classification using back translation and multilinguality which ranked 6th out of 27 teams with a macro F1 score of 0.74.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (21)
  1. Overview of mex-a3t at iberlef 2019: Authorship and aggressiveness analysis in mexican spanish tweets. In Proceedings of IberLEF.
  2. Unsupervised cross-lingual representation learning at scale. In Proceedings of ACL.
  3. Hate speech and offensive language detection in bengali. In Proceedings of AACL.
  4. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of NAACL.
  5. Google. 2021. Google cloud translation api documentation. Accessed: 2023-08-28.
  6. BanglaHateBERT: BERT for abusive language detection in Bengali. In Proceedings ResTUP.
  7. Muril: Multilingual representations for indian languages. arXiv preprint arXiv:2103.10730.
  8. Bangla-bert: transformer-based efficient model for transfer learning and language understanding. IEEE Access, 10:91855–91870.
  9. Benchmarking Aggression Identification in Social Media. In Proceedings of TRAC.
  10. Evaluating aggression identification in social media. In Proceedings of TRAC.
  11. Overview of the hasoc subtrack at fire 2021: Hate speech and offensive content identification in english and indo-aryan languages and conversational hate speech. In Proceedings of FIRE.
  12. OpenAI. 2023. Gpt-3.5 turbo fine-tuning and api updates. Accessed: 2023-08-28.
  13. Bengali hate speech detection in public facebook pages. In Proceedings of ICISET.
  14. Bd-shs: A benchmark dataset for learning to detect online bangla hate speech in different social contexts. In Proceedings of LREC.
  15. Blp-2023 task 1: Violence inciting text detection (vitd). In Proceedings of the 1st International Workshop on Bangla Language Processing (BLP-2023).
  16. Vio-lens: A novel dataset of annotated social network posts leading to different forms of communal violence and its evaluation. In Proceedings of BLP.
  17. L-boost: Identifying offensive texts from social media post in bengali. Ieee Access, 9:164681–164699.
  18. Vicarious offense and noise audit of offensive speech classifiers: Unifying human and machine disagreement on what is offensive. In Proceedings of EMNLP.
  19. SemEval-2019 task 6: Identifying and categorizing offensive language in social media (OffensEval). In Proceedings of SemEval.
  20. SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020). In Proceedings of SemEval.
  21. Improving zero-shot cross-lingual hate speech detection with pseudo-label fine-tuning of transformer language models. In Proceedings of ICWSM.
Citations (3)

Summary

We haven't generated a summary for this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Lightbulb On Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.