Convergence Rates of Non-Convex Stochastic Gradient Descent Under a Generic Lojasiewicz Condition and Local Smoothness

Kevin Scaman, Cédric Malherbe, Ludovic Dos Santos

Published: 2022, Last Modified: 12 May 2023ICML 2022Readers: Everyone

Abstract: Training over-parameterized neural networks involves the empirical minimization of highly non-convex objective functions. Recently, a large body of works provided theoretical evidence that, despite...

0 Replies