TY - JOUR
T1 - A technique for noise robust voice activity detection under uncontrolled environment
AU - G, Nagaraja B.
AU - G, Thimmaraja Yadava
AU - Kabballi, Prashanth
AU - P, Raghudathesh G.
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
PY - 2025/6
Y1 - 2025/6
N2 - Voice activity detection (VAD) is a critical component in speech processing systems. Traditional VAD methods work well in clean and controlled environments but perform poorly in real-life scenarios where various noise sources interfere with the speech signal. This article presents a technique for achieving noise robust VAD under such adverse conditions. The proposed system consists of a background noise suppression module based on the minimum mean square error spectrum power estimator using zero crossing (MMSE-SPZC), which is added before vector quantization-based VAD (VQ-VAD). Through extensive experimentation on the NOIZEUS and NIST-SRE10 databases with varying levels of noise, the effectiveness of the proposed technique is demonstrated. The results indicate substantial improvements in VAD accuracy, even in the presence of background noise. We provide an open-source implementation of the method. https://sites.google.com/view/thimmarajayadavag/downloads.
AB - Voice activity detection (VAD) is a critical component in speech processing systems. Traditional VAD methods work well in clean and controlled environments but perform poorly in real-life scenarios where various noise sources interfere with the speech signal. This article presents a technique for achieving noise robust VAD under such adverse conditions. The proposed system consists of a background noise suppression module based on the minimum mean square error spectrum power estimator using zero crossing (MMSE-SPZC), which is added before vector quantization-based VAD (VQ-VAD). Through extensive experimentation on the NOIZEUS and NIST-SRE10 databases with varying levels of noise, the effectiveness of the proposed technique is demonstrated. The results indicate substantial improvements in VAD accuracy, even in the presence of background noise. We provide an open-source implementation of the method. https://sites.google.com/view/thimmarajayadavag/downloads.
UR - https://www.scopus.com/pages/publications/85200157485
UR - https://www.scopus.com/inward/citedby.url?scp=85200157485&partnerID=8YFLogxK
U2 - 10.1007/s11042-024-19960-9
DO - 10.1007/s11042-024-19960-9
M3 - Article
AN - SCOPUS:85200157485
SN - 1380-7501
VL - 84
SP - 22069
EP - 22081
JO - Multimedia Tools and Applications
JF - Multimedia Tools and Applications
IS - 20
ER -