Amortized Variational Inference: When and Why?

Charles Margossian; David Blei

Amortized Variational Inference: When and Why?

Charles Margossian, David Blei

Published: 26 Apr 2024, Last Modified: 15 Jul 2024UAI 2024 posterEveryoneRevisionsBibTeXCC BY 4.0

Keywords: variational inference; Bayesian inference; latent variable model

TL;DR: We study the scope of amortize variational inference as a general purpose Bayesian inference algorithm and an alternative to factorized (mean-field) variational inference.

Abstract: In a probabilistic latent variable model, factorized (or mean-field) variational inference (F-VI) fits a separate parametric distribution for each latent variable. Amortized variational inference (A-VI) instead learns a common inference function, which maps each observation to its corresponding latent variable's approximate posterior. Typically, A-VI is used as a cog in the training of variational autoencoders, however it stands to reason that A-VI could also be used as a general alternative to F-VI. In this paper we study when and why A-VI can be used for approximate Bayesian inference. We derive conditions on a latent variable model which are necessary, sufficient, and verifiable under which A-VI can attain F-VI's optimal solution, thereby closing the amortization gap. We prove these conditions are uniquely verified by simple hierarchical models, a broad class that encompasses many models in machine learning. We then show, on a broader class of models, how to expand the domain of AVI’s inference function to improve its solution, and we provide examples, e.g. hidden Markov models, where the amortization gap cannot be closed.

Supplementary Material: zip

List Of Authors: Margossian, Charles C and Blei, David M

Latex Source Code: zip

Signed License Agreement: pdf

Submission Number: 443

Loading