Skip to main navigation Skip to search Skip to main content

Fighting hate speech from bilingual hinglish speaker’s perspective, a transformer- and translation-based approach.

  • Shankar Biradar*
  • , Sunil Saumya
  • , Arun chauhan
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Many people have begun to use social media platforms due to the increased use of the Internet over the previous decade. It has a lot of benefits, but it also comes with a lot of risks and drawbacks, such as Hate speech. People in multilingual societies, such as India, frequently mix their native language with English while speaking, so detecting hate content in such bilingual code-mixed data has drawn the larger interest of the research community. The majority of previous work focuses on high-resource language such as English, but very few researchers have concentrated on the mixed bilingual data like Hinglish. In this study, we investigated the performance of transformer models like IndicBERT and multilingual Bidirectional Encoder Representation(mBERT), as well as transfer learning from pre-trained language models like ULMFiT and Bidirectional encoder Representation(BERT), to find hateful content in Hinglish. Also, Transformer-based Interpreter and Feature extraction model on Deep Neural Network (TIF-DNN), is proposed in this work. The experimental results found that our proposed model outperforms existing state-of-art methods for Hate speech identification in Hinglish language with an accuracy of 73%.

Original languageEnglish
Article number87
JournalSocial Network Analysis and Mining
Volume12
Issue number1
DOIs
Publication statusPublished - 12-2022

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Communication
  • Media Technology
  • Human-Computer Interaction
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Fighting hate speech from bilingual hinglish speaker’s perspective, a transformer- and translation-based approach.'. Together they form a unique fingerprint.

Cite this