Integrating online i-vector into GMM-UBM for text-dependent speaker verification

Published: 01 Jan 2017, Last Modified: 22 May 2024APSIPA 2017EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: GMM-UBM is widely used for the text-dependent task for its simplicity and effectiveness, while i-vector provides a compact representation for speaker information. Thus it is promising to fuse these two frameworks. In this paper, a variation of traditional i-vector extracted at frame level is appended with MFCC as tandem features. Incorporating this feature into GMM-UBM system achieves 26% and 41% performance gain compared with DNN i-vector baseline on the RSR2015 and RedDots evaluation set, respectively. Moreover, the performance of the proposed system that trained on 86 hours data is on par with that of the DNN i-vector baseline trained on a much larger dataset (5000 hours).
Loading

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview