Super-Acceleration with Cyclical Step-sizes

Baptiste Goujaud; Damien Scieur; Aymeric Dieuleveut; Adrien Taylor; Fabian Pedregosa

Super-Acceleration with Cyclical Step-sizes

Baptiste Goujaud, Damien Scieur, Aymeric Dieuleveut, Adrien Taylor, Fabian Pedregosa

21 May 2021 (modified: 05 May 2023)NeurIPS 2021 SubmittedReaders: Everyone

Keywords: Heavy ball, momentum, cyclical step-sizes, acceleration, convex optimization, first-order, quadratic optimization

TL;DR: We show that step-size schedules offer a simple way to exploit spectral gaps of Hessians in machine learning

Abstract: Cyclical step-sizes are becoming increasingly popular in the optimization of deep learning problems. Motivated by recent observations on the spectral gaps of Hessians in machine learning, we show that these step-size schedules offer a simple way to exploit them. More precisely, we develop a convergence rate analysis for quadratic objectives that provides optimal parameters and shows that cyclical learning rates can improve upon traditional lower complexity bounds. We further propose a systematic approach to design optimal first order methods for quadratic minimization with a given spectral structure. Finally, we provide a local convergence rate analysis beyond quadratic minimization for the proposed methods and illustrate our findings through benchmarks on least squares and logistic regression problems.

Code Of Conduct: I certify that all co-authors of this work have read and commit to adhering to the NeurIPS Statement on Ethics, Fairness, Inclusivity, and Code of Conduct.

Supplementary Material: zip

12 Replies

Loading