Skip to main navigation Skip to search Skip to main content

IIITDWD@TamilNLP-ACL2022: Transformer-based approach to classify abusive content in Dravidian Code-mixed text

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Identifying abusive content or hate speech in social media text has raised the research community's interest in recent times. The major driving force behind this is the widespread use of social media websites. Further, it also leads to identifying abusive content in low-resource regional languages, which is an important research problem in computational linguistics. As part of ACL-2022, organizers of DravidianLangTech@ACL 2022 have released a shared task on abusive category identification in Tamil and Tamil-English code-mixed text to encourage further research on offensive content identification in low-resource Indic languages. This paper presents the working notes for the model submitted by IIITDWD at DravidianLangTech@ACL 2022. Our team competed in Sub-Task B and finished in 9th place among the participating teams. In our proposed approach, we used a pre-trained transformer model such as Indic-bert for feature extraction, and on top of that, SVM classifier is used for stance detection. Further, our model achieved 62 % accuracy on code-mixed Tamil-English text.

Original languageEnglish
Title of host publicationDravidianLangTech 2022 - 2nd Workshop on Speech and Language Technologies for Dravidian Languages, Proceedings of the Workshop
EditorsBharathi Raja Chakravarthi, Ruba Priyadharshini, Anand Kumar Madasamy, Parameswari Krishnamurthy, Elizabeth Sherly, Sinnathamby Mahesan
PublisherAssociation for Computational Linguistics (ACL)
Pages100-104
Number of pages5
ISBN (Electronic)9781955917346
DOIs
Publication statusPublished - 2022
Event2nd Workshop on Speech and Language Technologies for Dravidian Languages, Proceedings of the Workshop, DravidianLangTech 2022 - Dublin, Ireland
Duration: 26-05-2022 → …

Publication series

NameDravidianLangTech 2022 - 2nd Workshop on Speech and Language Technologies for Dravidian Languages, Proceedings of the Workshop

Conference

Conference2nd Workshop on Speech and Language Technologies for Dravidian Languages, Proceedings of the Workshop, DravidianLangTech 2022
Country/TerritoryIreland
CityDublin
Period26-05-22 → …

All Science Journal Classification (ASJC) codes

  • Language and Linguistics
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'IIITDWD@TamilNLP-ACL2022: Transformer-based approach to classify abusive content in Dravidian Code-mixed text'. Together they form a unique fingerprint.

Cite this