Preventing Posterior Collapse with delta-VAEs

Ali Razavi; Aaron van den Oord; Ben Poole; Oriol Vinyals

Preventing Posterior Collapse with delta-VAEs

Ali Razavi, Aaron van den Oord, Ben Poole, Oriol Vinyals

Published: 21 Dec 2018, Last Modified: 22 Jun 2025ICLR 2019 Conference Blind SubmissionReaders: Everyone

Abstract: Due to the phenomenon of “posterior collapse,” current latent variable generative models pose a challenging design choice that either weakens the capacity of the decoder or requires altering the training objective. We develop an alternative that utilizes the most powerful generative models as decoders, optimize the variational lower bound, and ensures that the latent variables preserve and encode useful information. Our proposed δ-VAEs achieve this by constraining the variational family for the posterior to have a minimum distance to the prior. For sequential latent variable models, our approach resembles the classic representation learning approach of slow feature analysis. We demonstrate our method’s efficacy at modeling text on LM1B and modeling images: learning representations, improving sample quality, and achieving state of the art log-likelihood on CIFAR-10 and ImageNet 32 × 32.

Keywords: Posterior Collapse, VAE, Autoregressive Models

TL;DR: Avoid posterior collapse by lower bounding the rate.

Data: [CIFAR-10](https://paperswithcode.com/dataset/cifar-10), [ImageNet](https://paperswithcode.com/dataset/imagenet)

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 1 code implementation](https://www.catalyzex.com/paper/preventing-posterior-collapse-with-delta-vaes/code)

11 Replies

Loading