TY - JOUR
T1 - Vocal fold pathology assessment using mel-frequency cepstral coefficients and linear predictive cepstral coefficients features
AU - Saldanha, Jennifer C.
AU - Ananthakrishna, T.
AU - Pinto, Rohan
PY - 2014
Y1 - 2014
N2 - It is possible to identify voice disorders using certain features of speech signals. A complementary technique could be acoustic analysis of the speech signal, which is shown to be a potentially useful tool to detect voice diseases. The focus of this study is to compare the performances of mel-frequency cepstral coefficients (MFCC) and linear predictive cepstral coefficients (LPCC) features in the detection of vocal fold pathology and also bring out scale to measure severity of the disease. The speech processing algorithm proposed estimates features necessary to formulate a stochastic model to characterize healthy and pathology conditions from speech recordings. Two different set of features such as MFCC and LPCC are extracted from acoustic analysis of voiced speech of normal and pathological subjects. A linear discriminant analysis (LDA) classifier, Principal component analysis (PCA) + Minimum distance classifier (MDC), Principal component analysis (PCA) + k-Nearest Neighbor (k-NN) classifier, PCA + LDA classifiers are designed and the classification results have been reported.
AB - It is possible to identify voice disorders using certain features of speech signals. A complementary technique could be acoustic analysis of the speech signal, which is shown to be a potentially useful tool to detect voice diseases. The focus of this study is to compare the performances of mel-frequency cepstral coefficients (MFCC) and linear predictive cepstral coefficients (LPCC) features in the detection of vocal fold pathology and also bring out scale to measure severity of the disease. The speech processing algorithm proposed estimates features necessary to formulate a stochastic model to characterize healthy and pathology conditions from speech recordings. Two different set of features such as MFCC and LPCC are extracted from acoustic analysis of voiced speech of normal and pathological subjects. A linear discriminant analysis (LDA) classifier, Principal component analysis (PCA) + Minimum distance classifier (MDC), Principal component analysis (PCA) + k-Nearest Neighbor (k-NN) classifier, PCA + LDA classifiers are designed and the classification results have been reported.
UR - http://www.scopus.com/inward/record.url?scp=84896910648&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84896910648&partnerID=8YFLogxK
U2 - 10.1166/jmihi.2014.1253
DO - 10.1166/jmihi.2014.1253
M3 - Article
AN - SCOPUS:84896910648
SN - 2156-7018
VL - 4
SP - 168
EP - 173
JO - Journal of Medical Imaging and Health Informatics
JF - Journal of Medical Imaging and Health Informatics
IS - 2
ER -