Quantitative Gaussian approximation of randomly initialized deep neural networks

Andrea Basteri; Dario Trevisan

Quantitative Gaussian approximation of randomly initialized deep neural networks

Andrea Basteri, Dario Trevisan

Published: 01 Jan 2024, Last Modified: 15 May 2025Mach. Learn. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Given any deep fully connected neural network, initialized with random Gaussian parameters, we bound from above the quadratic Wasserstein distance between its output distribution and a suitable Gaussian process. Our explicit inequalities indicate how the hidden and output layers sizes affect the Gaussian behaviour of the network and quantitatively recover the distributional convergence results in the wide limit, i.e., if all the hidden layers sizes become large.

Loading