2018 (modified: 11 Nov 2022)ICML 2018Readers: Everyone
Abstract:We consider deep linear networks with arbitrary convex differentiable loss. We provide a short and elementary proof of the fact that all local minima are global minima if the hidden layers are eith...