Decentralized SGD and Average-direction SAM are Asymptotically EquivalentDownload PDFOpen Website

Published: 2023, Last Modified: 29 Sept 2023ICML 2023Readers: Everyone
Abstract: Decentralized stochastic gradient descent (D-SGD) allows collaborative learning on massive devices simultaneously without the control of a central server. However, existing theories claim that dece...
0 Replies

Loading