Extracting Additional Information from Gaussian Mixture Model Probabilities for Improved Text-Independent Speaker Identification

Abstract: This paper addresses the problem of robust text-independent speaker identification. A voting mechanism is proposed to combine probabilities generated using Gaussian mixture models (GMMs). This algorithm is evaluated on standard data sets and shown to improve performance. This method is found to decrease error rate by up to 68.6% relative on KING database and 34.9% relative on SPIDRE. An analysis is performed and a hypothesis is proposed as to why this algorithm does not give as good an identification rate in certain cases. A method of using voting along with the standard GMM method is described which overcomes this limitation. This second method is evaluated and found to decrease error rate by as much as 45.67% relative on the SPIDRE databases. It is found to give a substantial improvement over conventional GMMs in all the experiments performed. Both the proposed algorithms achieve increased accuracy with negligible increase in computational cost.
0 Replies
Loading