Collapsed Variational Bounds for Bayesian Neural Networks

Marcin B. Tomczak; Siddharth Swaroop; Andrew Y. K. Foong; Richard E Turner

Collapsed Variational Bounds for Bayesian Neural Networks

Marcin B. Tomczak, Siddharth Swaroop, Andrew Y. K. Foong, Richard E Turner

Published: 09 Nov 2021, Last Modified: 05 May 2023NeurIPS 2021 PosterReaders: Everyone

Keywords: Bayesian Neural Networks, BNNs, Bayesian Deep Learning, variational inference, VI, underfitting, collapsed ELBO, overpruning

TL;DR: We derive a family of collapsed, tighter ELBOs to learn variational posteriors over weights in Bayesian Neural Networks.

Abstract: Recent interest in learning large variational Bayesian Neural Networks (BNNs) has been partly hampered by poor predictive performance caused by underfitting, and their performance is known to be very sensitive to the prior over weights. Current practice often fixes the prior parameters to standard values or tunes them using heuristics or cross-validation. In this paper, we treat prior parameters in a distributional way by extending the model and collapsing the variational bound with respect to their posteriors. This leads to novel and tighter Evidence Lower Bounds (ELBOs) for performing variational inference (VI) in BNNs. Our experiments show that the new bounds significantly improve the performance of Gaussian mean-field VI applied to BNNs on a variety of data sets, demonstrating that mean-field VI works well even in deep models. We also find that the tighter ELBOs can be good optimization targets for learning the hyperparameters of hierarchical priors.

Code Of Conduct: I certify that all co-authors of this work have read and commit to adhering to the NeurIPS Statement on Ethics, Fairness, Inclusivity, and Code of Conduct.

Supplementary Material: pdf

Code: https://github.com/marctom/collapsed_bnns

12 Replies

Loading