Vision transformer-powered conversational agent for real-time Indian Sign Language e-governance accessibility

  • Prema Nedungadi*
  • , Gautham Dileep*
  • , M. Geetha*
  • , Raghu Raman
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

This paper presents a vision transformer-based, context-aware, real-time Indian Sign Language (ISL) conversational agent designed to enhance digital accessibility for India’s Deaf and Hard-of-Hearing community within e-governance services. The system supports continuous sign language recognition, from isolated words to full ISL sentences, and delivers information and services directly in sign language. A domain-specific ISL video dataset, incorporating diverse signing styles and environments, addresses ISL’s low-resource challenges, enabling robust, scalable real-world deployment. The hybrid architecture, combining convolutional neural networks with vision transformer models, effectively handles ISL’s spatial–temporal complexities, maintaining reliability even with complex queries. A dynamic context-response mapping engine uses contextual data to increase accuracy, particularly for ambiguous inputs. The modular design ensures efficient scalability, facilitating seamless integration of new services. System evaluations, including stress testing and usability studies, confirmed its effectiveness in enabling real-time, inclusive digital interactions. Under optimized conditions, the system achieved 97.5% accuracy, a mean response time of 6.53 s, and an average system usability score of 71.83, significantly advancing digital inclusion in India.

Original languageEnglish
Article number33055
JournalScientific Reports
Volume15
Issue number1
DOIs
Publication statusPublished - 12-2025

All Science Journal Classification (ASJC) codes

  • General

Fingerprint

Dive into the research topics of 'Vision transformer-powered conversational agent for real-time Indian Sign Language e-governance accessibility'. Together they form a unique fingerprint.

Cite this