TY - GEN
T1 - Multilingual Music Deepfake Detection Using WavLM and MERT Audio Embeddings
AU - Ponnada, Tejaswini
AU - Agarwal, Pawani
AU - Ignisha Rajathi, G.
AU - Yasir Abdullah, R.
AU - Mohanalin, J.
AU - Johny Elton, R.
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.
PY - 2026
Y1 - 2026
N2 - With the growth in AI-generated material, a new issue has emerged in the form of deepfake music. These deepfakes are constructed with advanced models that can mimic human sounds or music in a variety of languages and accents. In this paper, we investigate the identification of multilingual music deepfakes - leveraging data from four languages: German, English, Spanish, and French. Regardless of the language or accent used, we aim to develop a system that can recognize phony music clips. We propose a novel framework that combines embeddings from WavLM-base-plus-sv, a speech and speaker-centric model, and MERT-v1-330M, an acoustic music understanding transformer, with a lightweight multilayer perceptron classifier to distinguish between real and deepfake music samples.
AB - With the growth in AI-generated material, a new issue has emerged in the form of deepfake music. These deepfakes are constructed with advanced models that can mimic human sounds or music in a variety of languages and accents. In this paper, we investigate the identification of multilingual music deepfakes - leveraging data from four languages: German, English, Spanish, and French. Regardless of the language or accent used, we aim to develop a system that can recognize phony music clips. We propose a novel framework that combines embeddings from WavLM-base-plus-sv, a speech and speaker-centric model, and MERT-v1-330M, an acoustic music understanding transformer, with a lightweight multilayer perceptron classifier to distinguish between real and deepfake music samples.
UR - https://www.scopus.com/pages/publications/105036971787
UR - https://www.scopus.com/pages/publications/105036971787#tab=citedBy
U2 - 10.1007/978-3-032-22059-2_25
DO - 10.1007/978-3-032-22059-2_25
M3 - Conference contribution
AN - SCOPUS:105036971787
SN - 9783032220585
T3 - Communications in Computer and Information Science
SP - 328
EP - 340
BT - Soft Computing and Its Engineering Applications - 7th International Conference, icSoftComp 2025, Proceedings
A2 - Patel, Kanubhai K.
A2 - Patel, Atul
A2 - Santosh, KC
A2 - Gomes de Oliveira, Gabriel
A2 - Ghosh, Ashish
PB - Springer Science and Business Media Deutschland GmbH
T2 - 7th International Conference on Soft Computing and its Engineering Applications, icSoftComp 2025
Y2 - 9 December 2025 through 11 December 2025
ER -