Diagnosing and Fixing Manifold Overfitting in Deep Generative Models

Gabriel Loaiza-Ganem; Brendan Leigh Ross; Jesse C Cresswell; Anthony L. Caterini

Diagnosing and Fixing Manifold Overfitting in Deep Generative Models

Gabriel Loaiza-Ganem, Brendan Leigh Ross, Jesse C Cresswell, Anthony L. Caterini

Published: 07 Aug 2022, Last Modified: 17 Sept 2024Accepted by TMLREveryoneRevisionsBibTeXCC BY 4.0

Authors that are also TMLR Expert Reviewers: ~Gabriel_Loaiza-Ganem1

Abstract: Likelihood-based, or explicit, deep generative models use neural networks to construct flexible high-dimensional densities. This formulation directly contradicts the manifold hypothesis, which states that observed data lies on a low-dimensional manifold embedded in high-dimensional ambient space. In this paper we investigate the pathologies of maximum-likelihood training in the presence of this dimensionality mismatch. We formally prove that degenerate optima are achieved wherein the manifold itself is learned but not the distribution on it, a phenomenon we call manifold overfitting. We propose a class of two-step procedures consisting of a dimensionality reduction step followed by maximum-likelihood density estimation, and prove that they recover the data-generating distribution in the nonparametric regime, thus avoiding manifold overfitting. We also show that these procedures enable density estimation on the manifolds learned by implicit models, such as generative adversarial networks, hence addressing a major shortcoming of these models. Several recently proposed methods are instances of our two-step procedures; we thus unify, extend, and theoretically justify a large class of models.

Submission Length: Regular submission (no more than 12 pages of main content)

Changes Since Last Submission: Camera-ready changes: de-anonymized submission, minor clarity changes, and added a slightly expanded discussion on the manifold hypothesis in the introduction.

Video: https://www.youtube.com/watch?v=-2YaUfMlwrU

Code: https://github.com/layer6ai-labs/two_step_zoo

Certifications: Expert Certification

Assigned Action Editor: ~Laurent_Dinh1

License: Creative Commons Attribution 4.0 International (CC BY 4.0)

Submission Number: 124

Loading