HiddenSinger: High-quality singing voice synthesis via neural audio codec and latent diffusion models
Abstract: Highlights•Introduce HiddenSinger, a high-quality singing voice synthesis model.•Propose HiddenSinger-U, an unsupervised learning framework to train with unlabeled datasets.•Audio samples are available at https://jisang93.github.io/hiddensinger-demo/.
Loading