Novel temporal and spectral features derived from TEO for classification normal and dysphonic voices

Hemant A. Patil, Pallavi N. Baljekar, T. K. Basu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this paper, various temporal features (i.e., zero crossing rate and short-time energy) and spectral features (spectral flux and spectral centroid) have been derived from the Teager energy operator (TEO) profile of the speech waveform. The efficacy of these features has been analyzed for the classification of normal and dysphonic voices by comparing their performance with the features derived from the linear prediction (LP) residual and the speech waveform. In addition, the effectiveness of fusing these features with state-of-the-art Mel frequency cepstral coefficients (MFCC) feature-set has also been investigated to understand whether these features provide complementary results. The classifier that has been used is the 2nd order polynomial classifier, with experiments being carried out on a subset of the Massachusetts Eye and Ear Infirmary (MEEI) database.

Original languageEnglish
Title of host publicationFrontiers in Computer Education
Pages559-567
Number of pages9
DOIs
Publication statusPublished - 24-05-2012
Externally publishedYes
Event2011 International Conference on Frontiers in Computer Education, ICFCE 2011 - Macao, China
Duration: 01-12-201102-12-2011

Publication series

NameAdvances in Intelligent and Soft Computing
Volume133 AISC
ISSN (Print)1867-5662

Conference

Conference2011 International Conference on Frontiers in Computer Education, ICFCE 2011
Country/TerritoryChina
CityMacao
Period01-12-1102-12-11

All Science Journal Classification (ASJC) codes

  • Computer Science(all)

Fingerprint

Dive into the research topics of 'Novel temporal and spectral features derived from TEO for classification normal and dysphonic voices'. Together they form a unique fingerprint.

Cite this