Learning to Pronounce Chinese Without a Pronunciation DictionaryDownload PDFOpen Website

2020 (modified: 05 Nov 2021)CoRR 2020Readers: Everyone
Abstract: We demonstrate a program that learns to pronounce Chinese text in Mandarin, without a pronunciation dictionary. From non-parallel streams of Chinese characters and Chinese pinyin syllables, it establishes a many-to-many mapping between characters and pronunciations. Using unsupervised methods, the program effectively deciphers writing into speech. Its token-level character-to-syllable accuracy is 89%, which significantly exceeds the 22% accuracy of prior work.
0 Replies

Loading