Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron

R. J. Skerry-Ryan, Eric Battenberg, Ying Xiao, Yuxuan Wang, Daisy Stanton, Joel Shor, Ron J. Weiss, Rob Clark, Rif A. Saurous

2018 (modified: 11 Nov 2022)ICML 2018Readers: Everyone

Abstract: We present an extension to the Tacotron speech synthesis architecture that learns a latent embedding space of prosody, derived from a reference acoustic representation containing the desired prosod...

0 Replies