TY - GEN
T1 - Review of Toolkit to Build Automatic Speech Recognition Models
AU - Raghudathesh, G. P.
AU - Chandrakala, C. B.
AU - Rao, B. Dinesh
N1 - Publisher Copyright:
© 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
PY - 2023
Y1 - 2023
N2 - Speech is one of the most significant types of communication between human beings. It is beginning to be a preferred means for communication between machines and humans. The mechanism of transforming human speech into its equivalent textual format is known as speech recognition. Various toolkits are being used to automate the process of speech-to-text conversion, and this process is referred to as automatic speech recognition (ASR). The usage of the ASR system is becoming prevalent with the implementation of human–machine interaction. Numerous speech-based assistive systems are available today used in several different areas. This paper provides insight into the ASR domain and toolkit used in ASR system—HTK, CMU Sphinx, Kaldi, and Julius with their comparative analysis in terms of installation, ease of use, and accuracy assessment.
AB - Speech is one of the most significant types of communication between human beings. It is beginning to be a preferred means for communication between machines and humans. The mechanism of transforming human speech into its equivalent textual format is known as speech recognition. Various toolkits are being used to automate the process of speech-to-text conversion, and this process is referred to as automatic speech recognition (ASR). The usage of the ASR system is becoming prevalent with the implementation of human–machine interaction. Numerous speech-based assistive systems are available today used in several different areas. This paper provides insight into the ASR domain and toolkit used in ASR system—HTK, CMU Sphinx, Kaldi, and Julius with their comparative analysis in terms of installation, ease of use, and accuracy assessment.
UR - http://www.scopus.com/inward/record.url?scp=85138805290&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85138805290&partnerID=8YFLogxK
U2 - 10.1007/978-981-19-4052-1_45
DO - 10.1007/978-981-19-4052-1_45
M3 - Conference contribution
AN - SCOPUS:85138805290
SN - 9789811940514
T3 - Lecture Notes in Networks and Systems
SP - 449
EP - 459
BT - Emerging Technologies in Data Mining and Information Security - Proceedings of IEMIS 2022
A2 - Dutta, Paramartha
A2 - Chakrabarti, Satyajit
A2 - Bhattacharya, Abhishek
A2 - Dutta, Soumi
A2 - Shahnaz, Celia
PB - Springer Science and Business Media Deutschland GmbH
T2 - 3rd International Conference on Emerging Technologies in Data Mining and Information Security, IEMIS 2022
Y2 - 23 February 2022 through 25 February 2022
ER -