Stretched Exponential Convergence of (Stochastic) Gradient Descent for Separable Logistic Regression

Published: 27 May 2026, Last Modified: 27 May 2026Accepted by TMLREveryoneRevisionsBibTeXCC BY 4.0
Abstract: Gradient descent and stochastic gradient descent are central to modern machine learning, yet their behavior under large step sizes remains theoretically unclear. Recent work suggests that acceleration often arises near the edge of stability, where optimization trajectories become unstable and difficult to analyze. Existing results for separable logistic regression achieve faster convergence by explicitly leveraging such unstable regimes through constant or adaptive large step sizes. In this paper, we show that instability is not inherent to acceleration. We prove that gradient descent with a simple, non-adaptive increasing step-size schedule achieves stretched exponential convergence for separable logistic regression under a margin condition, while remaining entirely within a stable optimization regime. The resulting method is anytime and does not require prior knowledge of the optimization horizon or target accuracy. We also establish stretched exponential convergence of stochastic gradient descent using a lightweight adaptive step-size rule that avoids line search and specialized procedures, improving upon existing polynomial-rate guarantees. Together, our results demonstrate that carefully structured step-size growth alone suffices to obtain stretched exponential acceleration for both gradient descent and stochastic gradient descent.
Submission Type: Regular submission (no more than 12 pages of main content)
Changes Since Last Submission: Switched to camera-ready template and added Acknowledgements.
Code: https://github.com/TFMLO-LAB-IISc/Stretched-Exponential-Rate-for-Separable-Logistic-Regression
Supplementary Material: zip
Assigned Action Editor: ~Alec_Koppel1
Submission Number: 7959
Loading