Speaker-conditioned phrase break prediction for text-to-speech with phoneme-level pre-trained language model.

Dong Yang, Yuki Saito 0001, Takaaki Saeki, Tomoki Koriyama, Wataru Nakata, Detai Xin, Hiroshi Saruwatari

18 Mar 2026Speech Commun. 2026EveryoneCC BY-SA 4.0
Loading