2019 (modified: 11 Nov 2022)ICML 2019Readers: Everyone
Abstract:To understand the dynamics of training in deep neural networks, we study the evolution of the Hessian eigenvalue density throughout the optimization process. In non-batch normalized networks, we ob...