Convergence Rates of Non-Convex Stochastic Gradient Descent Under a Generic Lojasiewicz Condition and Local SmoothnessDownload PDFOpen Website

Published: 01 Jan 2022, Last Modified: 12 May 2023ICML 2022Readers: Everyone
Abstract: Training over-parameterized neural networks involves the empirical minimization of highly non-convex objective functions. Recently, a large body of works provided theoretical evidence that, despite...
0 Replies

Loading