IIITDWD_SVC@DravidianLangTech-2024: Breaking Language Barriers; Hate Speech Detection in Telugu-English Code-Mixed Text

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

Social media platforms have become increasingly popular and are utilized for a wide range of purposes, including product promotion, news sharing, accomplishment sharing, and much more. However, it is also employed for defamatory speech, intimidation, and the propagation of untruths about particular groups of people. Further, hateful and offensive posts spread quickly and often have a negative impact on people; it is important to identify and remove them from social media platforms as soon as possible. Over the past few years, research on hate speech detection and offensive content has grown in popularity. One of the many difficulties in identifying hate speech on social media platforms is the use of code-mixed language. The majority of people who use social media typically share their messages in languages with mixed codes, like Telugu–English. To encourage research in this direction, the organizers of DravidianLangTech@EACL-2024 conducted a shared task to identify hateful content in Telugu-English code-mixed text. Our team participated in this shared task, employing three different models: Xlm-Roberta, BERT, and Hate-BERT. In particular, our BERT-based model secured the 14th rank in the competition with a macro F1 score of 0.65.

Original languageEnglish
Title of host publicationDravidianLangTech 2024 - 4th Workshop on Speech, Vision, and Language Technologies for Dravidian Languages, Proceedings of the Workshop
EditorsBharathi Raja Chakravarthi, Ruba Priyadharshini, Anand Kumar Madasamy, Sajeetha Thavareesan, Elizabeth Sherly, Rajeswari Nadarajan, Manikandan Ravikiran
PublisherAssociation for Computational Linguistics (ACL)
Pages119-123
Number of pages5
ISBN (Electronic)9798891760783
Publication statusPublished - 2024
Event4th Workshop on Speech, Vision, and Language Technologies for Dravidian Languages, DravidianLangTech 2024 - Hybrid, St. Julian's, Malta
Duration: 22-03-2024 → …

Publication series

NameDravidianLangTech 2024 - 4th Workshop on Speech, Vision, and Language Technologies for Dravidian Languages, Proceedings of the Workshop

Conference

Conference4th Workshop on Speech, Vision, and Language Technologies for Dravidian Languages, DravidianLangTech 2024
Country/TerritoryMalta
CityHybrid, St. Julian's
Period22-03-24 → …

All Science Journal Classification (ASJC) codes

  • Language and Linguistics
  • Computer Science (miscellaneous)
  • Computational Mathematics
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'IIITDWD_SVC@DravidianLangTech-2024: Breaking Language Barriers; Hate Speech Detection in Telugu-English Code-Mixed Text'. Together they form a unique fingerprint.

Cite this