A two-microphone based voice activity detection for distant-talking speech in wide range of direction of arrival

Published: 01 Jan 2012, Last Modified: 15 May 2025ICASSP 2012EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: In this paper, a two-microphone based voice activity detection (VAD) algorithm is proposed to detect the distant-talking speech coming randomly from a wide range of direction of arrival (DOA). The long-term information of inter-channel phase difference (LTIPD) is introduced as a target speech existence measure, which describes the concentration degree of DOA estimations on a sound source with harmonic structure. The proposed algorithm performs robustly on distant-talking speech recorded in several real environments.
Loading