Messages

This page shows the samples in the paper "Boosting Fast and High-Quality Speech Synthesis with Linear Diffusion".

Experiments were based on LJSpeech and LibriTTS.

The framework of LinDiff:

Single Speaker (LJSpeech Dataset)

GT WaveNet WaveGlow HIFI-GAN WaveGrad(6) FastDiff(4) LinDiff(1) LinDiff(3) LinDiff(100)

Multi Speakers (LibriTTS Dataset)

GT WaveNet WaveGlow HIFI-GAN WaveGrad(6) FastDiff(4) LinDiff(1) LinDiff(3) LinDiff(100)