Abstract: A hybrid coder is introduced for obtaining descriptions of speech patterns. This coder uses vector quantization (VQ) techniques on mel-scale cepstral coefficients and their derivatives together with a recurrent network (RN) for describing suprasegmental features of speech. The purpose of these features is to focus the search when hidden Markov models (HMMs) are used for speech unit or word models. Preliminary experiments of speaker-independent connected digit recognition show that using a hybrid coder based on a RN improves recognition performance.<
0 Replies
Loading