Benefits of Overparameterization in Single-Layer Latent Variable Generative Models

Rares-Darius Buhai; Andrej Risteski; Yoni Halpern; David Sontag

Benefits of Overparameterization in Single-Layer Latent Variable Generative Models

Rares-Darius Buhai, Andrej Risteski, Yoni Halpern, David Sontag

25 Sept 2019 (modified: 22 Jun 2025)ICLR 2020 Conference Blind SubmissionReaders: Everyone

TL;DR: Overparameterization aids parameter recovery in unsupervised settings.

Abstract: One of the most surprising and exciting discoveries in supervising learning was the benefit of overparameterization (i.e. training a very large model) to improving the optimization landscape of a problem, with minimal effect on statistical performance (i.e. generalization). In contrast, unsupervised settings have been under-explored, despite the fact that it has been observed that overparameterization can be helpful as early as Dasgupta & Schulman (2007). In this paper, we perform an exhaustive study of different aspects of overparameterization in unsupervised learning via synthetic and semi-synthetic experiments. We discuss benefits to different metrics of success (recovering the parameters of the ground-truth model, held-out log-likelihood), sensitivity to variations of the training algorithm, and behavior as the amount of overparameterization increases. We find that, when learning using methods such as variational inference, larger models can significantly increase the number of ground truth latent variables recovered.

Code: https://drive.google.com/file/d/1bKia5vceblhQuggssScteiUR_QfoVzmu/view?usp=sharing

Keywords: overparameterization, unsupervised, parameter recovery, rigorous experiments

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 1 code implementation](https://www.catalyzex.com/paper/benefits-of-overparameterization-in-single/code)

Original Pdf: pdf

7 Replies

Loading