Generative semi-supervised learning with a neural seq2seq noisy channel

Published: 19 Jun 2023, Last Modified: 28 Jul 20231st SPIGM @ ICML PosterEveryoneRevisionsBibTeX
Keywords: semi-supervised, generative modeling, seq2seq, paired data, parallel data, noisy channel
TL;DR: Principled neural seq2seq generative semi-supervised learning which works.
Abstract: We use a neural noisy channel generative model to learn the relationship between two sequences, for example text and speech, from little paired data. We identify time locality as a key assumption which is restrictive enough to support semi-supervised learning but general enough to be widely applicable. Experimentally we show that our approach is capable of recovering the relationship between written and spoken language (represented as graphemes and phonemes) from only 5 minutes of paired data. Our results pave the way for more widespread adoption of generative semi-supervised learning for seq2seq tasks.
Submission Number: 123
Loading