A Framework for Recording Audio-Visual Speech Corpora with a Microphone and a High-Speed Camera

Published: 01 Jan 2014, Last Modified: 28 Mar 2025SPECOM 2014EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: In this paper, we present a novel software framework for recording audio-visual speech corpora with a high-speed video camera (JAI Pulnix RMC 6740) and a dynamic microphone (Oktava MK-012) Architecture of the developed software framework for recording audio-visual Russian speech corpus is described. It provides synchronization and fusion of audio and video data captured by the independent sensors. The software automatically detects voice activity in audio signal and stores only speech fragments discarding non-informative signals. It takes into account and processes natural asynchrony of audio-visual speech modalities as well.
Loading