Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
157 tokens/sec
GPT-4o
43 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

The Unappreciated Role of Intent in Algorithmic Moderation of Social Media Content (2405.11030v1)

Published 17 May 2024 in cs.CL

Abstract: As social media has become a predominant mode of communication globally, the rise of abusive content threatens to undermine civil discourse. Recognizing the critical nature of this issue, a significant body of research has been dedicated to developing LLMs that can detect various types of online abuse, e.g., hate speech, cyberbullying. However, there exists a notable disconnect between platform policies, which often consider the author's intention as a criterion for content moderation, and the current capabilities of detection models, which typically lack efforts to capture intent. This paper examines the role of intent in content moderation systems. We review state of the art detection models and benchmark training datasets for online abuse to assess their awareness and ability to capture intent. We propose strategic changes to the design and development of automated detection and moderation systems to improve alignment with ethical and policy conceptualizations of abuse.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (273)
  1. Automatic hate speech detection using machine learning: A comparative study. International Journal of Advanced Computer Science and Applications, 11(8).
  2. Comparative study for predicting the severity of cyberbullying across multiple social media platforms. In 2020 4th International Conference on Intelligent Computing and Control Systems (ICICCS), 871–877. IEEE.
  3. Abusive Comment Detection in Social Media with Bidirectional LSTM Model. In 2023 5th International Conference on Smart Systems and Inventive Technology (ICSSIT), 1368–1373. IEEE.
  4. Cyber harm: concepts, taxonomy and measurement. Saïd Business School WP, 23.
  5. Performance analysis of transformer-based architectures and their ensembles to detect trait-based cyberbullying. Social Network Analysis and Mining, 12(1): 99.
  6. Am i being bullied on social media? an ensemble approach to categorize cyberbullying. In 2021 IEEE international conference on big data (Big data), 2442–2453. IEEE.
  7. Q-bully: a reinforcement learning based cyberbullying detection framework. In 2020 International conference for emerging technology (INCET), 1–6. IEEE.
  8. Cyberbullying Detection and Classification in Social Media Texts Using Machine Learning Techniques. In International Conference on Computer Science, Engineering and Education Applications, 440–449. Springer.
  9. Cybercrime detection in online communications: The experimental case of cyberbullying detection in the Twitter network. Computers in Human Behavior, 63: 433–443.
  10. Al Mazari, A. 2013. Cyber-bullying taxonomies: Definition, forms, consequences and mitigation strategies. In 2013 5th International Conference on Computer Science and Information Technology, 126–133. IEEE.
  11. Pinpointing fine-grained relationships between hateful tweets and replies. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 36, 10418–10426.
  12. Not All Counterhate Tweets Elicit the Same Replies: A Fine-Grained Analysis. In Proceedings of the 12th Joint Conference on Lexical and Computational Semantics (* SEM 2023), 71–88.
  13. Cyberbullying detection: an overview. In 2018 Cyber Resilience Conference (CRC), 1–3. IEEE.
  14. Cyberbullying Detection Approaches: A Review. In 2023 5th International Conference on Inventive Research in Computing Applications (ICIRCA), 1310–1316. IEEE.
  15. A literature review of textual hate speech detection methods and datasets. Information, 13(6): 273.
  16. Smart detection of offensive words in social media using the soundex algorithm and permuterm index. International Journal of Electrical and Computer Engineering (IJECE), 11(5): 4431–4438.
  17. Aggressors and victims in bullying and cyberbullying: A study of personality profiles using the five-factor model. The Spanish journal of psychology, 20: E76.
  18. TheNorth at HASOC 2019: Hate Speech Detection in Social Media Data. In FIRE (Working Notes), 293–299.
  19. Hate speech detection using transformer ensembles on the hasoc dataset. In International conference on speech and computer, 13–21. Springer.
  20. A multichannel deep learning framework for cyberbullying detection on social media. Electronics, 10(21): 2664.
  21. A review on abusive content automatic detection: approaches, challenges and opportunities. PeerJ Computer Science, 8: e1142.
  22. Alrehili, A. 2019. Automatic hate speech detection on social media: A brief survey. In 2019 IEEE/ACS 16th International Conference on Computer Systems and Applications (AICCSA), 1–6. IEEE.
  23. A Survey of Cyberbullying Detection and Performance: Its Impact in Social Media Using Artificial Intelligence. SN Computer Science, 4(6): 859.
  24. Ana, O. S. 1999. Like an animal I was treated’: Anti-immigrant metaphor in US public discourse. Discourse & society, 10(2): 191–224.
  25. Analysis of online toxicity detection using machine learning approaches. In International Conference on Artificial Intelligence and Sustainable Engineering: Select Proceedings of AISE 2020, Volume 1, 381–392. Springer.
  26. Hate speech, toxicity detection in online social media: a recent survey of state of the art and opportunities. International Journal of Information Security, 23(1): 577–608.
  27. Robust hate speech detection in social media: A cross-dataset empirical evaluation. arXiv preprint arXiv:2307.01680.
  28. Revisiting contextual toxicity detection in conversations. ACM Journal of Data and Information Quality, 15(1): 1–22.
  29. Automatic identification and classification of misogynistic language on twitter. In Natural Language Processing and Information Systems: 23rd International Conference on Applications of Natural Language to Information Systems, NLDB 2018, Paris, France, June 13-15, 2018, Proceedings 23, 57–64. Springer.
  30. Detecting harmful content on online platforms: what platforms need vs. where research efforts go. ACM Computing Surveys, 56(3): 1–17.
  31. Using NLP Techniques for Cyberbullying Tweet Recognition. In 2023 3rd International Conference on Innovative Mechanisms for Industry Applications (ICIMIA), 1231–1236. IEEE.
  32. Abusive language detection in youtube comments leveraging replies as conversational context. PeerJ Computer Science, 7: e742.
  33. Angrybert: Joint learning target and emotion for hate speech detection. In Pacific-Asia conference on knowledge discovery and data mining, 701–713. Springer.
  34. We fear for our lives: Offline and online experiences of anti-Muslim hostility.
  35. Interpretable and high-performance hate and offensive speech detection. In International Conference on Human-Computer Interaction, 233–244. Springer.
  36. Improving cyberbullying detection using Twitter users’ psychological features and machine learning. Computers & Security, 90: 101710.
  37. A unified taxonomy of harmful content. In Proceedings of the fourth workshop on online abuse and harms, 125–137.
  38. TweetEval: Unified benchmark and comparative evaluation for tweet classification. arXiv preprint arXiv:2010.12421.
  39. Semeval-2019 task 5: Multilingual detection of hate speech against immigrants and women in twitter. In Proceedings of the 13th international workshop on semantic evaluation, 54–63.
  40. Bias, subjectivity and perspectives in natural language processing. Frontiers in Artificial Intelligence, 5: 926435.
  41. Metaheuristic ant lion and moth flame optimization-based novel approach for automatic detection of hate speech in online social networks. IEEE Access, 9: 110047–110062.
  42. Baydogan, C.; et al. 2022. Deep-Cov19-Hate: A textual-based novel approach for automatic detection of hate speech in online social networks throughout COVID-19 with shallow and deep learning models. Tehnički vjesnik, 29(1): 149–156.
  43. Data expansion using back translation and paraphrasing for hate speech detection. Online Soc Networks Media 24.
  44. Climbing towards NLU: On meaning, form, and understanding in the age of data. In Proceedings of the 58th annual meeting of the association for computational linguistics, 5185–5198.
  45. Cyberbullying detection on social media using SVM. In Inventive Systems and Control: Proceedings of ICISC 2021, 17–27. Springer.
  46. Machine Learning Techniques for Hate Speech Detection on Social Media. In 2023 3rd International Conference on Innovative Sustainable Computational Technologies (CISCT), 1–5. IEEE.
  47. Bilen, A. 2023. A Review: Detection of Discrimination and Hate Speech Shared on Social Media Platforms Using Artificial Intelligence Methods. Algorithmic Discrimination and Ethical Perspective of Artificial Intelligence, 171–181.
  48. Latent dirichlet allocation. Journal of machine Learning research, 3(Jan): 993–1022.
  49. Bode, L. 2016. Political news in the news feed: Learning politics from social media. Mass communication and society, 19(1): 24–48.
  50. Cyberbullying detection on social media using machine learning. In IEEE INFOCOM 2023-IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), 1–6. IEEE.
  51. Cyberbullying detection: Utilizing social media features. Expert Systems with Applications, 179: 115001.
  52. Brown, A. 2017a. What is hate speech? Part 1: The myth of hate. Law and Philosophy, 36: 419–468.
  53. Brown, A. 2017b. What is hate speech? Part 2: Family resemblances. Law and Philosophy, 36: 561–613.
  54. Automated cyberbullying detection in social media using an svm activated stacked convolution lstm network. In Proceedings of the 2020 4th International Conference on Compute and Data Analysis, 170–174.
  55. Bunde, E. 2021. AI-assisted and explainable hate speech detection for social media moderators–A design science approach.
  56. Explainable abuse detection as intent classification and slot filling. Transactions of the Association for Computational Linguistics, 10: 1440–1454.
  57. Cyberbullying: Definition, consequences, prevalence. In Reducing cyberbullying in schools, 3–16. Elsevier.
  58. I feel offended, don’t be abusive! implicit/explicit messages in offensive and abusive language. In Proceedings of the twelfth language resources and evaluation conference, 6193–6202.
  59. Graph embeddings for abusive language detection. SN Computer Science, 2: 1–15.
  60. Deep learning approaches for cyberbullying detection and classification on social media. Computational Intelligence and Neuroscience, 2022.
  61. The Internet’s hidden rules: An empirical study of Reddit norm violations at micro, meso, and macro scales. Proceedings of the ACM on Human-Computer Interaction, 2(CSCW): 1–25.
  62. Mean birds: Detecting aggression and bullying on twitter. In Proceedings of the 2017 ACM on web science conference, 13–22.
  63. Dynamic, incremental, and continuous detection of cyberbullying in online social media. ACM Transactions on the Web (TWEB), 15(3): 1–33.
  64. Harnessing the power of text mining for the detection of abusive content in social media. In Advances in Computational Intelligence Systems: Contributions Presented at the 16th UK Workshop on Computational Intelligence, September 7–9, 2016, Lancaster, UK, 187–205. Springer.
  65. A comparison of classical versus deep learning techniques for abusive content detection on social media sites. In Social Informatics: 10th International Conference, SocInfo 2018, St. Petersburg, Russia, September 25-28, 2018, Proceedings, Part I 10, 117–133. Springer.
  66. HENIN: Learning heterogeneous neural interaction networks for explainable cyberbullying detection on social media. arXiv preprint arXiv:2010.04576.
  67. PI-bully: Personalized cyberbullying detection with peer influence. In The 28th International Joint Conference on Artificial Intelligence (IJCAI).
  68. Xbully: Cyberbullying detection within a multi-modal context. In Proceedings of the twelfth acm international conference on web search and data mining, 339–347.
  69. Ciampaglia, G. L. 2018. Fighting fake news: a role for computational social science in the fight against digital misinformation. Journal of Computational Social Science, 1(1): 147–153.
  70. Social norms and the expression and suppression of prejudice: the struggle for internalization. Journal of personality and social psychology, 82(3): 359.
  71. Crump, D. 2009. What Does Intent Mean. Hofstra L. Rev., 38: 1059.
  72. Experts and machines against bullies: A hybrid approach to detect cyberbullies. In Advances in Artificial Intelligence: 27th Canadian Conference on Artificial Intelligence, Canadian AI 2014, Montréal, QC, Canada, May 6-9, 2014. Proceedings 27, 275–281. Springer.
  73. Improving cyberbullying detection with user context. In Advances in Information Retrieval: 35th European Conference on IR Research, ECIR 2013, Moscow, Russia, March 24-27, 2013. Proceedings 35, 693–696. Springer.
  74. Ensemble Learning with Tournament Selected Glowworm Swarm Optimization Algorithm for Cyberbullying Detection on Social Media. IEEE Access.
  75. Data Bootstrapping Approaches to Improve Low Resource Abusive Language Detection for Indic Languages. arXiv preprint arXiv:2204.12543.
  76. Automated hate speech detection and the problem of offensive language. In Proceedings of the international AAAI conference on web and social media, volume 11, 512–515.
  77. Automatic offensive language detection from Twitter data using machine learning and feature selection of metadata. In 2020 international joint conference on neural networks (IJCNN), 1–6. IEEE.
  78. An Improved Detection of Cyberbullying on Social Media Using Randomized Sampling. International Journal of Bullying Prevention, 1–13.
  79. When the timeline meets the pipeline: A survey on automated cyberbullying detection. IEEE access, 9: 103541–103563.
  80. Peer to peer hate: Hate speech instigators and their targets. In Proceedings of the International AAAI Conference on Web and Social Media, volume 12.
  81. Early detection of deception and aggressiveness using profile-based representations. Expert Systems with Applications, 89: 99–111.
  82. A hybrid deep learning approach for abusive text detection. In AIP Conference Proceedings, volume 2753. AIP Publishing.
  83. Overview of the task on automatic misogyny identification at IberEval 2018. Ibereval@ sepln, 2150: 214–228.
  84. A survey on automatic detection of hate speech in text. ACM Computing Surveys (CSUR), 51(4): 1–30.
  85. Large scale crowdsourcing and characterization of twitter abusive behavior. In Proceedings of the international AAAI conference on web and social media, volume 12.
  86. A typology of disinformation intentionality and impact. Information Systems Journal.
  87. Friction-In-Design Regulation as 21st Century Time, Place, and Manner Restriction. Yale JL & Tech., 25: 376.
  88. Re-engineering humanity. Cambridge University Press.
  89. Hate speech detection: A comprehensive review of recent works. Expert Systems, e13562.
  90. A systematic bibliometric analysis of hate speech detection on social media sites. Journal of Scientometric Research, 11(1): 100–111.
  91. Detecting online hate speech using context aware models. arXiv preprint arXiv:1710.07395.
  92. Recognizing explicit and implicit hate speech using a weakly supervised two-path bootstrapping approach. arXiv preprint arXiv:1710.07394.
  93. Analysis and classification of abusive textual content detection in online social media. In Intelligent Communication Technologies and Virtual Mobile Networks: Proceedings of ICICV 2022, 173–190. Springer.
  94. Auto-Off ID: Automatic Detection of Offensive Language in Social Media. In Journal of Physics: Conference Series, volume 1911, 012012. IOP Publishing.
  95. A hashtag worth a thousand words: Discursive strategies around# JeNeSuisPasCharlie after the 2015 Charlie Hebdo shooting. Social Media+ Society, 3(1): 2056305116686992.
  96. A large labeled corpus for online harassment research. In Proceedings of the 2017 ACM on web science conference, 229–233.
  97. Exploring hate speech detection in multimodal publications. In Proceedings of the IEEE/CVF winter conference on applications of computer vision, 1470–1478.
  98. Feature Representation Techniques for Hate Speech Detection on Social Media: A Comparative Study. In 2022 International Conference on Signal and Information Processing (IConSIP), 1–6. IEEE.
  99. A survey of explainable AI techniques for detection of fake news and hate speech on social media platforms. Journal of Computational Social Science, 1–37.
  100. Experimental Evaluation of Robust Cyberbullying Detection over social media using Intelligent Learning Scheme. In 2023 International Conference on Research Methodologies in Knowledge Management, Artificial Intelligence and Telecommunication Engineering (RMKMATE), 1–7. IEEE.
  101. Hate towards the political opponent: A Twitter corpus study of the 2020 US elections on the basis of offensive speech and stance detection. arXiv preprint arXiv:2103.01664.
  102. A Survey on Deep Learning Models to Detect Hate Speech and Bullying in Social Media. In Artificial Intelligence for Societal Issues, 27–44. Springer.
  103. Social Media Hate Speech Detection Using Machine Learning Approach. In International Conference on Computational Intelligence in Data Science, 218–229. Springer.
  104. nlpUP at SemEval-2020 Task 12: A blazing fast system for offensive language detection. In Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2098–2104.
  105. Whom to trust? Media exposure patterns of citizens with perceptions of misinformation and disinformation related to the news media. European journal of communication, 37(3): 237–268.
  106. Social media cyberbullying detection using machine learning. International Journal of Advanced Computer Science and Applications, 10(5): 703–707.
  107. Hanu, L.; and Unitary team. 2020. Detoxify. Github. https://github.com/unitaryai/detoxify.
  108. Automatic Detection of Cyberbullying on Social Media Using Machine Learning. In 2023 2nd International Conference on Advancements in Electrical, Electronics, Communication, Computing and Automation (ICAECA), 1–6. IEEE.
  109. Hashemi, M. 2021. A data-driven framework for coding the intent and extent of political tweeting, disinformation, and extremism. Information, 12(4): 148.
  110. Racism is a virus: Anti-Asian hate and counterspeech in social media during the COVID-19 crisis. In Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 90–94.
  111. AdelaideCyC at SemEval-2020 task 12: Ensemble of classifiers for offensive language detection in social media. In Proceedings of the Fourteenth Workshop on Semantic Evaluation, 1516–1523.
  112. Cyber bullying detection using social and textual analysis. In Proceedings of the 3rd International Workshop on Socially-aware Multimedia, 3–6.
  113. A multitask learning framework for abuse detection and emotion classification. Algorithms, 15(4): 116.
  114. Cyberbullying Detection on Social Media: A Brief Survey. In 2023 Second International Conference on Advanced Computer Applications (ACA), 1–6. IEEE.
  115. AlexU-BackTranslation-TL at SemEval-2020 task 12: Improving offensive language detection using data augmentation and transfer learning. In Proceedings of the fourteenth workshop on semantic evaluation, 1881–1890.
  116. Women’s Perspectives on Harm and Justice after Online Harassment. Proceedings of the ACM on Human-Computer Interaction, 6(CSCW2): 1–23.
  117. Exploring definition of cyberbullying and its forms from the perspective of adolescents living in Pakistan. Psychological studies, 67(4): 514–523.
  118. Detection of Cyberbullying in Social Media Texts Using Explainable Artificial Intelligence. In International Conference on Ubiquitous Security, 319–334. Springer.
  119. Racist and sexist hate speech detection: Literature review. In 2020 international conference on intelligent data science technologies and applications (IDSTA), 95–99. IEEE.
  120. Cyberbullying detection solutions based on deep learning architectures. Multimedia Systems, 29(3): 1839–1852.
  121. Data expansion using wordnet-based semantic expansion and word disambiguation for cyberbullying detection. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, 1761–1770.
  122. A systematic review of Hate Speech automatic detection using Natural Language Processing. Neurocomputing, 126232.
  123. Detection of cyberbullying on social media using machine learning. In 2021 5th international conference on computing methodologies and communication (ICCMC), 1091–1096. IEEE.
  124. When does a compliment become sexist? analysis and classification of ambivalent sexism using twitter data. In Proceedings of the second workshop on NLP and computational social science, 7–16.
  125. Characterizing community guidelines on social media platforms. In Companion Publication of the 2020 Conference on Computer Supported Cooperative Work and Social Computing, 287–291.
  126. Attention-based method for categorizing different types of online harassment language. In Machine Learning and Knowledge Discovery in Databases: International Workshops of ECML PKDD 2019, Würzburg, Germany, September 16–20, 2019, Proceedings, Part II, 321–330. Springer.
  127. Abusive content detection in online user-generated data: a survey. Procedia Computer Science, 189: 274–281.
  128. A context aware embedding for the detection of hate speech in social media networks. In 2021 International Conference on Smart Generation Computing, Communication and Networking (SMART GENCON), 1–4. IEEE.
  129. Smart Language Checker: A Machine Learning Solution for Offensive Language detection in Social Media. In 2023 International Conference on Data Science, Agents & Artificial Intelligence (ICDSAAI), 1–6. IEEE.
  130. Offensive Language Detection on Online Social Networks using Hybrid Deep Learning Architecture. International Journal of Advanced Computer Science & Applications, 14(11).
  131. Online hate and harmful content: Cross-national perspectives. Taylor & Francis.
  132. Introducing the Gab Hate Corpus: defining and applying hate-based rhetoric to social media posts at scale. Language Resources and Evaluation, 1–30.
  133. Constructing interval variables via faceted Rasch measurement and multitask deep learning: a hate speech application. arXiv preprint arXiv:2009.10277.
  134. Characterization and mechanical properties of offensive language taxonomy and detection techniques. Materials Today: Proceedings, 81: 630–633.
  135. Explainable Offensive Language Classifier. In International Conference on Neural Information Processing, 299–313. Springer.
  136. Challenges of hate speech detection in social media: Data scarcity, and leveraging external resources. SN Computer Science, 2(2): 95.
  137. Leveraging external resources for offensive content detection in social media. AI Communications, 35(2): 87–109.
  138. Hate Speech Detection in Multi-social Media Using Deep Learning. In International Conference on Advanced Communication and Intelligent Systems, 59–70. Springer.
  139. A Bi-GRU with attention and CapsNet hybrid model for cyberbullying detection on social media. World Wide Web, 25(4): 1537–1550.
  140. Deep learning for hate speech detection in social media. In 2021 IEEE 4th International Conference on Computing, Power and Communication Technologies (GUCON), 1–4. IEEE.
  141. Watch your language: large language models and content moderation. arXiv preprint arXiv:2309.14517.
  142. A study of machine learning-based models for detection, control, and mitigation of cyberbullying in online social media. International Journal of Information Security, 21(6): 1409–1431.
  143. News sharing in social media: A review of current research on news sharing users, content, and networks. Social media+ society, 1(2): 2056305115610141.
  144. Enhancing Hate Speech Detection for Social Media Moderation: A Comparative Analysis of Machine Learning Algorithms. In 2023 International Conference on Advanced Mechatronics, Intelligent Manufacture and Industrial Automation (ICAMIMIA), 960–964. IEEE.
  145. What is hate speech? The case for a corpus approach. Criminal Law and Philosophy, 1–34.
  146. An integrated explicit and implicit offensive language taxonomy. Lodz Papers in Pragmatics, 19(1): 7–48.
  147. Re-Search for The Truth: Multi-round Retrieval-augmented Large Language Models are Strong Fake News Detectors. arXiv preprint arXiv:2403.09747.
  148. Combining Pre-Trained Language Models and Features for Offensive Language Detection. In 2022 13th International Congress on Advanced Applied Informatics Winter (IIAI-AAI-Winter), 5–10. IEEE.
  149. NULI at SemEval-2019 task 6: Transfer learning for offensive language detection using bidirectional transformers. In Proceedings of the 13th international workshop on semantic evaluation, 87–91.
  150. Non-linguistic features for cyberbullying detection on a social media platform using machine learning. In Cyberspace Safety and Security: 11th International Symposium, CSS 2019, Guangzhou, China, December 1–3, 2019, Proceedings, Part I 11, 391–406. Springer.
  151. Site agnostic approach to early detection of cyberbullying on social media networks. Sensors, 23(10): 4788.
  152. Early detection of cyberbullying on social media networks. Future Generation Computer Systems, 118: 219–229.
  153. Cyberbullying detection in social media text based on character-level convolutional neural network with shortcuts. Concurrency and Computation: Practice and Experience, 32(23): e5627.
  154. Hate speech detection: Challenges and solutions. PloS one, 14(8): e0221152.
  155. Toxic speech detection using traditional machine learning models and bert and fasttext embedding with deep neural networks. In 2021 5th International Conference on Computing Methodologies and Communication (ICCMC), 1254–1259. IEEE.
  156. Overview of the hasoc track at fire 2019: Hate speech and offensive content identification in indo-european languages. In Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation, 14–17.
  157. Twitter hate speech detection: a systematic review of methods, taxonomy analysis, challenges, and opportunities. IEEE Access, 11: 16226–16249.
  158. The role of context in detecting the target of hate speech. In Proceedings of the Third Workshop on Threat, Aggression and Cyberbullying (TRAC 2022), October, Gyeongju, Republic of Korea, 37–42.
  159. Media manipulation and disinformation online.
  160. HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection. arXiv preprint arXiv:2012.10289.
  161. Hatexplain: A benchmark dataset for explainable hate speech detection. In Proceedings of the AAAI conference on artificial intelligence, volume 35, 14867–14875.
  162. Analysis of Tweets for Cyberbullying Detection. In 2023 Third International Conference on Secure Cyber Computing and Communication (ICSCCC), 269–274. IEEE.
  163. Social media hate speech detection using explainable artificial intelligence (XAI). Algorithms, 15(8): 291.
  164. Abuse is contextual, what about nlp? the role of context in abusive language annotation and detection. arXiv preprint arXiv:2103.14916.
  165. Hate speech and offensive language detection from social media. In 2021 International Conference on Computing, Electronic and Electrical Engineering (ICE Cube), 1–5. IEEE.
  166. Hate Speech Detection in Social Media (Twitter) Using Neural Network. J. Mobile Multimedia, 19(3): 765–798.
  167. Cyber hate speech on twitter: Analyzing disruptive events from social media to build a violent communication and hate speech taxonomy. International Journal of Design & Nature and Ecodynamics, 11(3): 406–415.
  168. Tackling online abuse: A survey of automated abuse detection methods. arXiv preprint arXiv:1908.06024.
  169. Cyber-aggression, cyberbullying, and cyber-grooming: A survey and research challenges. ACM Computing Surveys (CSUR), 54(1): 1–42.
  170. Modi, S. 2018. AHTDT-Automatic Hate Text Detection Techniques in Social Media. In 2018 International Conference on Circuits and Systems in Digital Enterprise Technology (ICCSDET), 1–3. IEEE.
  171. On the importance of word embedding in automated harmful information detection. In International Conference on Text, Speech, and Dialogue, 251–262. Springer.
  172. “Fake news” is not simply false information: A concept explication and taxonomy of online content. American behavioral scientist, 65(2): 180–212.
  173. A BERT-based transfer learning approach for hate speech detection in online social media. In Complex Networks and Their Applications VIII: Volume 1 Proceedings of the Eighth International Conference on Complex Networks and Their Applications COMPLEX NETWORKS 2019 8, 928–940. Springer.
  174. Hate speech detection and racial bias mitigation in social media based on BERT model. PloS one, 15(8): e0237861.
  175. Advances in machine learning algorithms for hate speech detection in social media: a review. IEEE Access, 9: 88364–88376.
  176. Cyberbullying detection on social media using stacking ensemble learning and enhanced BERT. Information, 14(8): 467.
  177. DEA-RNN: A hybrid deep learning approach for cyberbullying detection in Twitter social media platform. IEEE Access, 10: 25857–25871.
  178. FAEO-ECNN: cyberbullying detection in social media platforms using topic modelling and deep learning. Multimedia Tools and Applications, 82(30): 46611–46650.
  179. Classification of Hate Speech Language Detection on Social Media: Preliminary Study for Improvement. In International Conference on Networking, Intelligent Systems and Security, 146–156. Springer.
  180. Hate speech detection on social media using graph convolutional networks. In Complex Networks & Their Applications X: Volume 2, Proceedings of the Tenth International Conference on Complex Networks and Their Applications COMPLEX NETWORKS 2021 10, 3–14. Springer.
  181. Unintended bias evaluation: An analysis of hate speech detection and gender bias mitigation on social media using ensemble learning. Expert Systems with Applications, 201: 117032.
  182. An Efficient Deep Learning-Based Hybrid Architecture for Hate Speech Detection in Social Media. In Data Science and Security: Proceedings of IDSCS 2022, 347–355. Springer.
  183. Nielsen, L. B. 2002. Subtle, pervasive, harmful: Racist and sexist remarks in public as hate speech. Journal of Social issues, 58(2): 265–280.
  184. Detection and classification of cyberbullying in social media using text mining. In 2022 6th International Conference on Electronics, Communication and Aerospace Technology, 856–861. IEEE.
  185. ProTect: a hybrid deep learning model for proactive detection of cyberbullying on social media. Frontiers in artificial intelligence, 7: 1269366.
  186. Cyberbullying: Labels, behaviours and definition in three European countries. Australian Journal of Guidance and Counselling, 20(2): 129–142.
  187. The effect of extremist violence on hateful speech online. In Proceedings of the international AAAI conference on web and social media, volume 12.
  188. A comparative analysis of machine learning algorithms for hate speech detection in social media. Online Journal of Communication and Media Technologies, 13(4): e202348.
  189. Multilingual and multi-aspect hate speech analysis. arXiv preprint arXiv:1908.11049.
  190. Securing Social Spaces: Cyberbullying Detection with ML and DL on Social Media Platforms. In 2023 International Conference on Sustainable Communication Networks and Application (ICSCNA), 1471–1476. IEEE.
  191. Hate speech detection in twitter using natural language processing. In 2021 Third International Conference on Intelligent Communication Technologies and Virtual Mobile Networks (ICICV), 1146–1152. IEEE.
  192. Convai at semeval-2019 task 6: Offensive language identification and categorization with perspective and bert. In Proceedings of the 13th international Workshop on Semantic Evaluation, 571–576.
  193. Hate speech: A systematized review. Sage Open, 10(4): 2158244020973022.
  194. Accurate cyberbullying detection and prevention on social media. Procedia Computer Science, 181: 605–611.
  195. Self-attention for cyberbullying detection. In 2020 International Conference on Cyber Situational Awareness, Data Analytics and Assessment (CyberSA), 1–6. IEEE.
  196. Offensive Language Detection in Social Media Using Ensemble Techniques. In 2023 International Conference on Circuit Power and Computing Technologies (ICCPCT), 805–808. IEEE.
  197. A benchmark dataset for learning to intervene in online hate speech. arXiv preprint arXiv:1909.04251.
  198. Improved Hierarchical Attention Networks for Cyberbullying Detection via Social Media Data. In 2023 IEEE International Conference on Networking, Sensing and Control (ICNSC), volume 1, 1–6. IEEE.
  199. Investigating user information and social media features in cyberbullying detection. In 2022 IEEE International Conference on Big Data (Big Data), 3063–3070. IEEE.
  200. Un-compromised credibility: Social media based multi-class hate speech classification for text. IEEE Access, 9: 109465–109477.
  201. Ramiandrisoa, F. 2022. Multi-task Learning for Hate Speech and Aggression Detection. In CIRCLE.
  202. Hate speech detection in social media: Techniques, recent trends, and future challenges. Wiley Interdisciplinary Reviews: Computational Statistics, 16(2): e1648.
  203. Balancing techniques for improving automated detection of hate speech and offensive language on social media. In 2023 2nd International Conference for Innovation in Technology (INOCON), 1–8. IEEE.
  204. A quality type-aware annotated corpus and lexicon for harassment research. In Proceedings of the 10th acm conference on web science, 33–36.
  205. Characterizing and detecting hateful users on twitter. In Proceedings of the International AAAI Conference on Web and Social Media, volume 12.
  206. Measuring the reliability of hate speech annotations: The case of the european refugee crisis. arXiv preprint arXiv:1701.08118.
  207. Text based hate-speech analysis. In 2021 International Conference on Artificial Intelligence and Smart Systems (ICAIS), 661–668. IEEE.
  208. Hatemonitors: Language agnostic abuse detection in social media. arXiv preprint arXiv:1909.12642.
  209. A Comparative Analysis of Machine Learning Techniques for Cyberbullying Detection on FormSpring in Textual Modality.
  210. An approach to detect cyberbullying on social media. In Model and Data Engineering: 10th International Conference, MEDI 2021, Tallinn, Estonia, June 21–23, 2021, Proceedings 10, 53–66. Springer.
  211. A large-scale English multi-label Twitter dataset for cyberbullying and online abuse detection. In The 5th Workshop on Online Abuse and Harms, 146–156. Association for Computational Linguistics.
  212. Abusive Language Detection on Social Media using Bidirectional Long-Short Term Memory. In 2022 IEEE 26th International Conference on Intelligent Engineering Systems (INES), 000243–000248. IEEE.
  213. Anatomy of online hate: developing a taxonomy and machine learning models for identifying and classifying hate in online news media. In Proceedings of the International AAAI Conference on Web and Social Media, volume 12.
  214. “Call me sexist, but…”: Revisiting Sexism Detection Using Psychological Scales and Adversarial Samples. In Proceedings of the international AAAI conference on web and social media, volume 15, 573–584.
  215. Ensemble Text Classification with TF-IDF Vectorization for Hate Speech Detection in Social Media. In 2023 International Conference on System, Computation, Automation and Networking (ICSCAN), 1–7. IEEE.
  216. A framework of severity for harmful content online. Proceedings of the ACM on Human-Computer Interaction, 5(CSCW2): 1–33.
  217. A survey on detection of cyberbullying in social media using machine learning techniques. In Intelligent Communication Technologies and Virtual Mobile Networks: Proceedings of ICICV 2022, 323–340. Springer.
  218. Cyberbullying Detection in Social Media Using Supervised ML and NLP Techniques. In Communication and Intelligent Systems: Proceedings of ICCIS 2021, 817–828. Springer.
  219. Nlp-cuet@ dravidianlangtech-eacl2021: Offensive language detection from multilingual code-mixed text using transformers. arXiv preprint arXiv:2103.00455.
  220. Automatic and Advance Techniques for Hate Speech Detection on Social Media: A Review. In 2022 Algorithms, Computing and Mathematics Conference (ACM), 54–61. IEEE.
  221. An Exploration of Machine Learning and Deep Learning Techniques for Offensive Text Detection in Social Media—A Systematic Review. In International Conference on Innovative Computing and Communications: Proceedings of ICICC 2022, Volume 3, 541–559. Springer.
  222. Deep Learning Approach for Hate and Non Hate Speech Detection in Online Social Media. In 2023 3rd International Conference on Technological Advancements in Computational Sciences (ICTACS), 492–496. IEEE.
  223. Shearer, E. 2018. Social media outpaces print newspapers in the US as a news source.
  224. Compressing Long Context for Enhancing RAG with AMR-based Concept Distillation. arXiv preprint arXiv:2405.03085.
  225. Deep learning based methods for cyberbullying detection on social media. In 2022 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS), 521–525. IEEE.
  226. An efficient automated multi-modal cyberbullying detection using decision fusion classifier on social media platforms. Multimedia Tools and Applications, 83(7): 20507–20535.
  227. NLP Based Hate Speech Detection And Moderation. In 2023 7th International Conference on Computation System and Information Technology for Sustainable Solutions (CSITSS), 1–5. IEEE.
  228. Toward multimodal cyberbullying detection. In Proceedings of the 2017 CHI Conference Extended Abstracts on Human Factors in Computing Systems, 2090–2099.
  229. Cyberbullying detection using probabilistic socio-textual information fusion. In 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), 884–887. IEEE.
  230. Combatting online harassment by using transformer language models for the detection of emotions, hate speech and offensive language on social media. In 2022 4th International Conference on Emerging Trends in Electrical, Electronic and Communications Engineering (ELECOM), 1–6. IEEE.
  231. A novel multimodal hybrid classifier based cyberbullying detection for social media platform. In Proceedings of the Computational Methods in Systems and Software, 689–699. Springer.
  232. Hybrid CNN-LSTM Network for Cyberbullying Detection on Social Networks using Textual Contents. International Journal of Advanced Computer Science and Applications, 14(9).
  233. Multimodal meme dataset (MultiOFF) for identifying offensive content in image and text. In Proceedings of the second workshop on trolling, aggression and cyberbullying, 32–41.
  234. Cyberbullying detection based on word curve representations using B-spline interpolation. In Proceedings of the 4th International Conference on Future Networks and Distributed Systems, 1–7.
  235. Upb at semeval-2020 task 12: Multilingual offensive language detection on social media by fine-tuning a variety of bert-based models. arXiv preprint arXiv:2010.13609.
  236. A multi-modal dataset for hate speech detection on social media: Case-study of russia-ukraine conflict. In CASE 2022-5th Workshop on Challenges and Applications of Automated Extraction of Socio-Political Events from Text, Proceedings of the Workshop. Association for Computational Linguistics.
  237. A study of text representations for Hate Speech Detection. In International Conference on Computational Linguistics and Intelligent Text Processing, 424–437. Springer.
  238. Ssn_nlp at SemEval 2020 Task 12: Offense Target Identification in Social Media Using Traditional and Deep Machine Learning Approaches. In Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2155–2160.
  239. SSN_NLP at SemEval-2019 task 6: Offensive language identification in social media using traditional and deep machine learning approaches. In Proceedings of the 13th International Workshop on Semantic Evaluation, 739–744.
  240. Large-scale hate speech detection with cross-domain transfer. arXiv preprint arXiv:2203.01111.
  241. A multi-platform dataset for detecting cyberbullying in social media. Language Resources and Evaluation, 54(4): 851–874.
  242. Automatic detection of cyberbullying in social media text. PloS one, 13(10): e0203794.
  243. Detecting East Asian prejudice on social media. arXiv preprint arXiv:2005.03909.
  244. Directions in abusive language training data, a systematic review: Garbage in, garbage out. Plos one, 15(12): e0243300.
  245. Challenges and frontiers in abusive content detection. In Proceedings of the third workshop on abusive language online. Association for Computational Linguistics.
  246. Introducing CAD: the contextual abuse dataset.
  247. Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection. In ACL.
  248. Multimodal Cyberbullying Detection on Social Media: Review and Challenges. In 2023 International Conference on Integration of Computational Intelligent System (ICICIS), 1–8. IEEE.
  249. Detect all abuse! toward universal abusive language detection models. arXiv preprint arXiv:2010.03776.
  250. Multi-modal cyberbullying detection on social networks. In 2020 International Joint Conference on Neural Networks (IJCNN), 1–8. IEEE.
  251. Evidence of inter-state coordination amongst state-backed information operations. Scientific reports, 13(1): 7716.
  252. From Yellow Peril to Model Minority: Asian stereotypes in social media during the COVID-19 pandemic. In Proceedings of the 15th ACM Web Science Conference 2023, 283–291.
  253. Waseem, Z. 2016. Are you a racist or am i seeing things? annotator influence on hate speech detection on twitter. In Proceedings of the first workshop on NLP and computational social science, 138–142.
  254. Understanding abuse: A typology of abusive language detection subtasks. arXiv preprint arXiv:1705.09899.
  255. Hateful symbols or hateful people? predictive features for hate speech detection on twitter. In Proceedings of the NAACL student research workshop, 88–93.
  256. Social media as information source: Recency of updates and credibility of information. Journal of computer-mediated communication, 19(2): 171–183.
  257. Detection of abusive language: the problem of biased datasets. In Proceedings of the 2019 conference of the North American Chapter of the Association for Computational Linguistics: human language technologies, volume 1 (long and short papers), 602–608.
  258. Alone: A dataset for toxic behavior among adolescents on twitter. In Social Informatics: 12th International Conference, SocInfo 2020, Pisa, Italy, October 6–9, 2020, Proceedings 12, 427–439. Springer.
  259. Wright, M. F. 2019. Cyberbullying: Definition, Description, Characteristics, and Consequences. In Handbook of Research on Children’s Consumption of Digital Media, 223–234. IGI Global.
  260. Wright, M. F. 2021. Cyberbullying: Definition, behaviors, correlates, and adjustment problems. In Encyclopedia of Information Science and Technology, Fifth Edition, 356–373. IGI Global.
  261. ToxCCIn: Toxic content classification with interpretability. arXiv preprint arXiv:2103.01328.
  262. Potential cyberbullying detection in social media platforms based on a multi-task learning framework. International Journal of Data and Network Science, 8(1): 25–34.
  263. Yao, M. 2019. Robust detection of cyberbullying in social media. In Companion Proceedings of The 2019 World Wide Web Conference, 61–66.
  264. Cyberbullying ends here: Towards robust detection of cyberbullying in social media. In The World Wide Web Conference, 3427–3433.
  265. Learning like human annotators: Cyberbullying detection in lengthy social media sessions. In Proceedings of the ACM Web Conference 2023, 4095–4103.
  266. Session-based cyberbullying detection in social media: A survey. Online Social Networks and Media, 36: 100250.
  267. Towards generalisable hate speech detection: a review on obstacles and solutions. PeerJ Computer Science, 7: e598.
  268. Transfer learning for hate speech detection in social media. Journal of Computational Social Science, 6(2): 1081–1101.
  269. Predicting the type and target of offensive posts in social media. arXiv preprint arXiv:1902.09666.
  270. Multiword expression features for automatic hate speech detection. In International Conference on Applications of Natural Language to Information Systems, 156–164. Springer.
  271. A Taxonomy of Rater Disagreements: Surveying Challenges & Opportunities from the Perspective of Annotating Online Toxicity. arXiv preprint arXiv:2311.04345.
  272. Aggressive, repetitive, intentional, visible, and imbalanced: Refining representations for cyberbullying classification. In Proceedings of the International AAAI Conference on Web and Social Media, volume 14, 808–819.
  273. Fake news: understanding media and misinformation in the digital age. MIT Press.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com