Review of Toolkit to Build Automatic Speech Recognition Models

G. P. Raghudathesh, C. B. Chandrakala, B. Dinesh Rao

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Speech is one of the most significant types of communication between human beings. It is beginning to be a preferred means for communication between machines and humans. The mechanism of transforming human speech into its equivalent textual format is known as speech recognition. Various toolkits are being used to automate the process of speech-to-text conversion, and this process is referred to as automatic speech recognition (ASR). The usage of the ASR system is becoming prevalent with the implementation of human–machine interaction. Numerous speech-based assistive systems are available today used in several different areas. This paper provides insight into the ASR domain and toolkit used in ASR system—HTK, CMU Sphinx, Kaldi, and Julius with their comparative analysis in terms of installation, ease of use, and accuracy assessment.

Original languageEnglish
Title of host publicationEmerging Technologies in Data Mining and Information Security - Proceedings of IEMIS 2022
EditorsParamartha Dutta, Satyajit Chakrabarti, Abhishek Bhattacharya, Soumi Dutta, Celia Shahnaz
PublisherSpringer Science and Business Media Deutschland GmbH
Pages449-459
Number of pages11
ISBN (Print)9789811940514
DOIs
Publication statusPublished - 2023
Event3rd International Conference on Emerging Technologies in Data Mining and Information Security, IEMIS 2022 - Kolkata, India
Duration: 23-02-202225-02-2022

Publication series

NameLecture Notes in Networks and Systems
Volume490
ISSN (Print)2367-3370
ISSN (Electronic)2367-3389

Conference

Conference3rd International Conference on Emerging Technologies in Data Mining and Information Security, IEMIS 2022
Country/TerritoryIndia
CityKolkata
Period23-02-2225-02-22

All Science Journal Classification (ASJC) codes

  • Control and Systems Engineering
  • Signal Processing
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Review of Toolkit to Build Automatic Speech Recognition Models'. Together they form a unique fingerprint.

Cite this