Abusive Speech Detection in Indic Languages Using Acoustic Features (2407.20808v1)

Published 30 Jul 2024 in cs.SD and eess.AS

Abstract: Abusive content in online social networks is a well-known problem that can cause serious psychological harm and incite hatred. The ability to upload audio data increases the importance of developing methods to detect abusive content in speech recordings. However, simply transferring the mechanisms from written abuse detection would ignore relevant information such as emotion and tone. In addition, many current algorithms require training in the specific language for which they are being used. This paper proposes to use acoustic and prosodic features to classify abusive content. We used the ADIMA data set, which contains recordings from ten Indic languages, and trained different models in multilingual and cross-lingual settings. Our results show that it is possible to classify abusive and non-abusive content using only acoustic and prosodic features. The most important and influential features are discussed.

Authors (4)

Anika A. Spiesberger (1 paper)
Andreas Triantafyllopoulos (42 papers)
Iosif Tsangko (8 papers)
Björn W. Schuller (153 papers)

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Abusive Speech Detection in Indic Languages Using Acoustic Features (2407.20808v1)

Summary

Related Papers