Gaussian Process Behaviour in Wide Deep Neural Networks

Alexander G. de G. Matthews; Jiri Hron; Mark Rowland; Richard E. Turner; Zoubin Ghahramani

Gaussian Process Behaviour in Wide Deep Neural Networks

Alexander G. de G. Matthews, Jiri Hron, Mark Rowland, Richard E. Turner, Zoubin Ghahramani

15 Feb 2018 (modified: 22 Jun 2025)ICLR 2018 Conference Blind SubmissionReaders: Everyone

Abstract: Whilst deep neural networks have shown great empirical success, there is still much work to be done to understand their theoretical properties. In this paper, we study the relationship between Gaussian processes with a recursive kernel definition and random wide fully connected feedforward networks with more than one hidden layer. We exhibit limiting procedures under which finite deep networks will converge in distribution to the corresponding Gaussian process. To evaluate convergence rates empirically, we use maximum mean discrepancy. We then exhibit situations where existing Bayesian deep networks are close to Gaussian processes in terms of the key quantities of interest. Any Gaussian process has a flat representation. Since this behaviour may be undesirable in certain situations we discuss ways in which it might be prevented.

Keywords: Gaussian Processes, Bayesian Deep Learning, Theory of Deep Neural Networks

Code: [![github](/images/github_icon.svg) widedeepnetworks/widedeepnetworks](https://github.com/widedeepnetworks/widedeepnetworks) + [![Papers with Code](/images/pwc_icon.svg) 1 community implementation](https://paperswithcode.com/paper/?openreview=H1-nGgWC-)

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 1 code implementation](https://www.catalyzex.com/paper/gaussian-process-behaviour-in-wide-deep/code)

8 Replies

Loading