Deep Learning Based Automated Lip Reading for Deaf

  • Prathyakshini
  • , Prathwini*
  • , N. Pratheeksha Hegde
  • , Vaishali
  • , N. Rashmi
  • , Archana Praveen Kumar
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Speech recognition systems play an integral role in numerous applications, from virtual assistants to accessibility tools. This paper offers a new viewpoint to speech recognition utilizing computer vision and deep learning techniques. The proposed system is trained on a sizable dataset comprising 700 video clips of individuals uttering predefined words, add up to approximately 3 GB of data. Leveraging TensorFlow and Keras, model architecture is designed incorporating convolutional and dense layers. The training process yielded promising results, with a training accuracy of 95.7% and a validation accuracy of 98.5%, indicative of robust classification performance. The integration of computer vision enriches the system's ability to extract meaningful features from audio-visual inputs, enhancing its overall recognition accuracy. Proposed method demonstrates significant potential for real-time speech recognition applications in different areas, like human-computer interaction and assistive technologies.

Original languageEnglish
Title of host publication2024 3rd International Conference for Advancement in Technology, ICONAT 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350354171
DOIs
Publication statusPublished - 2024
Event3rd International Conference for Advancement in Technology, ICONAT 2024 - Goa, India
Duration: 13-09-202414-09-2024

Publication series

Name2024 3rd International Conference for Advancement in Technology, ICONAT 2024

Conference

Conference3rd International Conference for Advancement in Technology, ICONAT 2024
Country/TerritoryIndia
CityGoa
Period13-09-2414-09-24

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Science Applications
  • Computer Vision and Pattern Recognition
  • Energy Engineering and Power Technology
  • Electrical and Electronic Engineering
  • Control and Optimization
  • Instrumentation

Fingerprint

Dive into the research topics of 'Deep Learning Based Automated Lip Reading for Deaf'. Together they form a unique fingerprint.

Cite this