Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-SpeechDownload PDFOpen Website

2021 (modified: 29 Sept 2021)ICML 2021Readers: Everyone
Abstract: Several recent end-to-end text-to-speech (TTS) models enabling single-stage training and parallel sampling have been proposed, but their sample quality does not match that of two-stage TTS systems....
0 Replies

Loading