KNN-based Speech-to-Text Conversion for Bangla to Enhance Regional Language Processing

  • Aditi Nayak*
  • , Jasmita Mukherjee
  • , Deepak Parashar
  • , Nilesh Bahadure
  • , Kshem Dikshit
  • , Rahul Joshi
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In order to close the digital divide and promote greater linguistic diversity in natural language processing applications, speech to text conversion for regional languages is a crucial first step. This study introduces a novel method for employing the K-Nearest Neighbors (KNN) algorithm to translate spoken Bangla, a regional language that is mostly spoken in Bangladesh and West Bengal, India, into text. By mapping audio data to linguistic representations, the suggested method uses the KNN's pattern recognition ability to convert Bangla voice into text. To extract the discriminative aspects of spoken language, we use the Mel-frequency cepstral coefficients (MFCCs), which are crucial audio features. To improve the model's resilience, our dataset includes a wide range of Bangla speech samples, including regional dialects and accents. This work contributes to the broader goal of democratizing speech technology for regional languages, empowering non-English-speaking communities, and making information accessible to a wider audience. The proposed KNN-based system opens the door to diverse applications of regional language interfaces.

Original languageEnglish
Title of host publicationProceedings - 3rd International Conference on Self Sustainable Artificial Intelligence Systems, ICSSAS 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1716-1720
Number of pages5
ISBN (Electronic)9798331538842
DOIs
Publication statusPublished - 2025
Event3rd International Conference on Self Sustainable Artificial Intelligence Systems, ICSSAS 2025 - Erode, India
Duration: 11-06-202513-06-2025

Publication series

NameProceedings - 3rd International Conference on Self Sustainable Artificial Intelligence Systems, ICSSAS 2025

Conference

Conference3rd International Conference on Self Sustainable Artificial Intelligence Systems, ICSSAS 2025
Country/TerritoryIndia
CityErode
Period11-06-2513-06-25

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Information Systems
  • Computational Mathematics
  • Control and Optimization
  • Theoretical Computer Science

Fingerprint

Dive into the research topics of 'KNN-based Speech-to-Text Conversion for Bangla to Enhance Regional Language Processing'. Together they form a unique fingerprint.

Cite this