2019 (modified: 11 Nov 2022)ICML 2019Readers: Everyone
Abstract:Despite its empirical success and recent theoretical progress, there generally lacks a quantitative analysis of the effect of batch normalization (BN) on the convergence and stability of gradient d...