Skip to main navigation Skip to search Skip to main content

Automated Categorization and Analysis of Kannada News Content Using Transformers

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Online news consumption has increased dramatically in the modern era, making effective classification methods for news content necessary. The process of systematically classifying news items so that readers can get pertinent information without needless effort is known as news categorisation. An efficient classification system is necessary to enhance accessibility and user experience because many sources publish large volumes of news stories every day. Using a dataset of 700 articles in seven different categories, this study concentrates on automated news category classification. We created our own dataset for particular categories, such as politics, crime, business, and technology, in order to overcome the problem of inconsistent data. We developed a supervised machine learning model to classify articles into business, entertainment, politics, sports, technology, lifestyle, and crime using the pre-trained XLMRoBERTa model. By screening articles according to user interests, this method optimises user engagement and allows for personalised content distribution. With an F1-score of 60.06% and a classification accuracy of 65.71%, our model proved effective in managing the multilingual and context-sensitive data found in online news.

Original languageEnglish
Title of host publicationProceedings - International Conference on Next Generation Communication and Information Processing, INCIP 2025
EditorsMahipal Bukya, Pramod Kumar, Sanyog Rawat, Mahesh Jangid
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages719-722
Number of pages4
ISBN (Electronic)9798331528140
DOIs
Publication statusPublished - 2025
Event2025 International Conference on Next Generation Communication and Information Processing, INCIP 2025 - Bangalore, India
Duration: 23-01-202524-01-2025

Publication series

NameProceedings - International Conference on Next Generation Communication and Information Processing, INCIP 2025

Conference

Conference2025 International Conference on Next Generation Communication and Information Processing, INCIP 2025
Country/TerritoryIndia
CityBangalore
Period23-01-2524-01-25

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 16 - Peace, Justice and Strong Institutions
    SDG 16 Peace, Justice and Strong Institutions

All Science Journal Classification (ASJC) codes

  • Electrical and Electronic Engineering
  • Energy Engineering and Power Technology
  • Electronic, Optical and Magnetic Materials
  • Computer Networks and Communications
  • Computer Science Applications
  • Control and Systems Engineering

Fingerprint

Dive into the research topics of 'Automated Categorization and Analysis of Kannada News Content Using Transformers'. Together they form a unique fingerprint.

Cite this