Skip to main navigation Skip to search Skip to main content

Performance Evaluation of Pre-Trained Models for Classification of Vocal Cord Paralysis over Vowels

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper intends to explore the use of various Deep Convolutional Neural Networks(DCNNs) and Vision Transformers(ViTs) in the classification of Vocal Cord Paralysis(VCP). A non-invasive approach to classify VCP using spectrograms can be an effective viewpoint for scrutinizing. From the Saarbruecken Voice Database (SVD) database, audio samples of the vowels /a/, / i /, and / u / at various pitches are considered and sampled at different frequencies. Utilizing the Transfer Learning(TL) along with various models, a remarkable accuracy of 93% for ViT-B/16 and 88% for MobileNet is observed for the vowel/a/ at 44.1 KHz. The vowels /u/ and /i/ also performed exceptionally well with an accuracy of 90% at 50 KHz and 87% at 16 KHz, respectively, expressing the distinguishing capability of the models over an imbalanced and limited dataset, which controlled the notion of the need for humongous input. As a result, tuning the frequency and vowels can help automate the analysis of the voice-associated pathology VCP.

Original languageEnglish
Title of host publication2nd IEEE International Conference on Networks, Multimedia and Information Technology, NMITCON 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350372892
DOIs
Publication statusPublished - 2024
Event2nd IEEE International Conference on Networks, Multimedia and Information Technology, NMITCON 2024 - Hybrid, Bengaluru, India
Duration: 09-08-202410-08-2024

Publication series

Name2nd IEEE International Conference on Networks, Multimedia and Information Technology, NMITCON 2024

Conference

Conference2nd IEEE International Conference on Networks, Multimedia and Information Technology, NMITCON 2024
Country/TerritoryIndia
CityHybrid, Bengaluru
Period09-08-2410-08-24

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Vision and Pattern Recognition
  • Information Systems
  • Information Systems and Management
  • Safety, Risk, Reliability and Quality
  • Media Technology

Fingerprint

Dive into the research topics of 'Performance Evaluation of Pre-Trained Models for Classification of Vocal Cord Paralysis over Vowels'. Together they form a unique fingerprint.

Cite this