A Single Global Merging Suffices: Recovering Centralized Learning Performance in Decentralized Learning
Track: long paper (up to 8 pages)
Keywords: Model Merging, Decentralized Learning
TL;DR: We discover and theoretically reveal why and when a single global parameter merging at the end of decentralized training can recover the performance of centralized training, even under heterogeneous and communication-constrained settings.
Submission Number: 5
Loading