Skip to main navigation Skip to search Skip to main content

Analysis of Frequencies and Pitches for Vocal Cord Paralysis Classification Through Transfer Learning

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Audio-based examination for Vocal Cord Paralysis (VCP) is a useful, non-invasive, cost-effective procedure, and Spectrogram analysis is one such boon for the classification of VCP versus Healthy subjects. However, spectrograms have not been utilized fully from their representation for classification purposes. This research aims to investigate spectrograms at various frequencies over different pitches to aid in accurately classifying VCP and healthy subjects. A voice disorders database named Saarbruecken Voice Database (SVD) contains audios of VCP and healthy subjects are collected. Using Transfer Learning (TL) and fine-tuning, Deep Convolutional Neural Networks (DCNNs) and Vision Transformers (ViTs) architectures are trained over the vowel /a/, and the derived characteristics are given for classification, yielding outstanding 100% accuracy. With minimal trainable parameters, the Deep Learning (DL) architectures performed exceptionally well, indicating the computational capability of the spectrograms in terms of robustness for pathological practice in the future.

Original languageEnglish
Title of host publicationProceedings of CONECCT 2024 - 10th IEEE International Conference on Electronics, Computing and Communication Technologies
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350385922
DOIs
Publication statusPublished - 2024
Event10th IEEE International Conference on Electronics, Computing and Communication Technologies, CONECCT 2024 - Bangalore, India
Duration: 12-07-202414-07-2024

Publication series

NameProceedings of CONECCT 2024 - 10th IEEE International Conference on Electronics, Computing and Communication Technologies

Conference

Conference10th IEEE International Conference on Electronics, Computing and Communication Technologies, CONECCT 2024
Country/TerritoryIndia
CityBangalore
Period12-07-2414-07-24

All Science Journal Classification (ASJC) codes

  • Control and Optimization
  • Instrumentation
  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Science Applications
  • Information Systems and Management
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Analysis of Frequencies and Pitches for Vocal Cord Paralysis Classification Through Transfer Learning'. Together they form a unique fingerprint.

Cite this