Dear reviewer,

Please open the file "index.html" in the browser of your choice. It contains three sections.
1. Stimuli from the evaluation test
	- Presenting samples shown to participants during the subjective evaluation
2. Additional examples from the proposed system (Diff-TTSG)
3. Examples showing the importance of the diffusion model
	- To illustrate the importance of using diffusion in modelling both speech and motion, these stimuli compare synthesis from condition D-TTSG to synthesis directly from the μ values predicted by the D-TTSG decoder and Conformer.