Federated Learning of a Mixture of Global and Local Models

Filip Hanzely; Peter Richtarik

Federated Learning of a Mixture of Global and Local Models

Filip Hanzely, Peter Richtarik

28 Sept 2020 (modified: 05 May 2023)ICLR 2021 Conference Blind SubmissionReaders: Everyone

Keywords: optimization, federated learning, personalization, local SGD

Abstract: We propose a new optimization formulation for training federated learning models. The standard formulation has the form of an empirical risk minimization problem constructed to find a single global model trained from the private data stored across all participating devices. In contrast, our formulation seeks an explicit trade-off between this traditional global model and the local models, which can be learned by each device from its own private data without any communication. Further, we develop several efficient variants of SGD (with and without partial participation and with and without variance reduction) for solving the new formulation and prove communication complexity guarantees. Notably, our methods are similar but not identical to federated averaging / local SGD, thus shedding some light on the essence of the elusive method. In particular, our methods do not perform full averaging steps and instead merely take steps towards averaging. We argue for the benefits of this new paradigm for federated learning.

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics

One-sentence Summary: We propose a new optimization formulation for training federated learning models, which enables the local algorithms to outperform their non-local counterparts in the heterogeneous data regime.

Supplementary Material: zip

Reviewed Version (pdf): /references/pdf?id=rTK4BMoatt

21 Replies

Loading