Published: 2019, Last Modified: 29 Apr 2023COLT 2019Readers: Everyone
Abstract:Folklore results in the theory of Stochastic Approximation indicates the (minimax) optimality of Stochastic Gradient Descent (SGD) (Robbins and Monro, 1951) with polynomially decaying stepsizes and...