2019 (modified: 11 Nov 2022)ICML 2019Readers: Everyone
Abstract:Most modern text-to-speech architectures use a WaveNet vocoder for synthesizing high-fidelity waveform audio, but there have been limitations, such as high inference time, in practical applications...