2019 (modified: 11 Nov 2022)ICML 2019Readers: Everyone
Abstract:Gradient descent finds a global minimum in training deep neural networks despite the objective function being non-convex. The current paper proves gradient descent achieves zero training loss in po...