Abstract: Highlights •Always-listening recognition pipeline for multi-room smart spaces.•Room-localized operation based on multi-room speech activity detection.•Channel selection and decision fusion approaches in all pipeline components.•Robust acoustic modeling based on far-field data simulation and per-channel adaptation.•Systematic pipeline evaluation and optimization on both simulated and real corpora.
Loading