Published: 01 Jan 2022, Last Modified: 12 May 2023UAI 2022Readers: Everyone
Abstract:Stochastic Gradient Descent (SGD) based methods have been widely used for training large-scale machine learning models that also generalize well in practice. Several explanations have been offered ...