Perceptual Generative Autoencoders

Zijun Zhang; Ruixiang Zhang; Zongpeng Li; Yoshua Bengio; Liam Paull

Perceptual Generative Autoencoders

Zijun Zhang, Ruixiang Zhang, Zongpeng Li, Yoshua Bengio, Liam Paull

Published: 03 May 2019, Last Modified: 22 Jun 2025DeepGenStruct 2019Readers: Everyone

TL;DR: A framework for training autoencoder-based generative models, with non-adversarial losses and unrestricted neural network architectures.

Abstract: Modern generative models are usually designed to match target distributions directly in the data space, where the intrinsic dimensionality of data can be much lower than the ambient dimensionality. We argue that this discrepancy may contribute to the difficulties in training generative models. We therefore propose to map both the generated and target distributions to the latent space using the encoder of a standard autoencoder, and train the generator (or decoder) to match the target distribution in the latent space. The resulting method, perceptual generative autoencoder (PGA), is then incorporated with maximum likelihood or variational autoencoder (VAE) objective to train the generative model. With maximum likelihood, PGA generalizes the idea of reversible generative models to unrestricted neural network architectures and arbitrary latent dimensionalities. When combined with VAE, PGA can generate sharper samples than vanilla VAE.

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 2 code implementations](https://www.catalyzex.com/paper/perceptual-generative-autoencoders/code)

3 Replies

Loading