Performance analysis of LPC and MFCC features in voice conversion using artificial neural networks

Shashidhar G. Koolagudi*, B. Kavya Vishwanath, M. Akshatha, Yarlagadda V.S. Murthy

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    5 Citations (Scopus)

    Abstract

    Voice Conversion is a technique in which source speakers voice is morphed to a target speakers voice by learning source–target relationship from a number of utterances from source and the target. There are many applications which may benefit from this sort of technology for example dubbing movies, TV-shows, TTS systems and so on. In this paper, analysis on the performance of ANN-based Voice Conversion system is done using linear predictive coding (LPC) and mel-frequency cepstral coefficients (MFCCs). Experimental results show that Voice Conversion system based on LPC features is better than the ones based on MFCC features.

    Original languageEnglish
    Title of host publicationProceedings of the International Conference on Data Engineering and Communication Technology
    EditorsVikrant Bhateja, Suresh Chandra Satapathy, Amit Joshi
    PublisherSpringer Verlag
    Pages275-280
    Number of pages6
    ISBN (Print)9789811016776
    DOIs
    Publication statusPublished - 2017
    Event1st International Conference on Data Engineering and Communication Technology, ICDECT 2016 - Lavasa City, Pune, India
    Duration: 10-03-201611-03-2016

    Publication series

    NameAdvances in Intelligent Systems and Computing
    Volume469
    ISSN (Print)2194-5357

    Conference

    Conference1st International Conference on Data Engineering and Communication Technology, ICDECT 2016
    Country/TerritoryIndia
    CityLavasa City, Pune
    Period10-03-1611-03-16

    All Science Journal Classification (ASJC) codes

    • Control and Systems Engineering
    • General Computer Science

    Fingerprint

    Dive into the research topics of 'Performance analysis of LPC and MFCC features in voice conversion using artificial neural networks'. Together they form a unique fingerprint.

    Cite this