Comfetch: Federated Learning of Large Networks on Constrained Clients via Sketching

Tahseen Rabbani; Brandon Yushan Feng; Marco Bornstein; Yifan Yang; Kyle Rui Sang; Arjun Rajkumar; Amitabh Varshney; Furong Huang

Comfetch: Federated Learning of Large Networks on Constrained Clients via Sketching

Tahseen Rabbani, Brandon Yushan Feng, Marco Bornstein, Yifan Yang, Kyle Rui Sang, Arjun Rajkumar, Amitabh Varshney, Furong Huang

20 Sept 2023 (modified: 25 Mar 2024)ICLR 2024 Conference Withdrawn SubmissionEveryoneRevisionsBibTeX

Keywords: federated learning, compression, sketch

TL;DR: We re-parameterize large architectures using count sketches to reduce communication, computational, and memory costs for federated clients.

Abstract: Federated learning (FL) is a popular paradigm for private and collaborative model training on the edge. In centralized FL, the parameters of a global architecture (such as a deep neural network) are maintained and distributed by a central server/controller to clients who transmit model updates (gradients) back to the server based on local optimization. While many efforts have focused on reducing the communication complexity of gradient transmission, the vast majority of compression-based algorithms assume that each participating client is able to download and train the current and full set of parameters, which may not be a practical assumption depending on the resource constraints of smaller clients such as mobile devices. In this work, we propose a simple yet effective novel algorithm Comfetch, which allows clients to train large networks using reduced representations of the global architecture via the count sketch, which reduces local computational and memory costs along with bi-directional communication complexity. We provide a nonconvex convergence guarantee and experimentally demonstrate that it is possible to learn large models, such as a deep convolutional network, through federated training on their sketched counterparts. The resulting global models exhibit competitive test accuracy over CIFAR10/100 classification when compared against un-compressed model training.

Supplementary Material: zip

Primary Area: general machine learning (i.e., none of the above)

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics.

Submission Guidelines: I certify that this submission complies with the submission instructions as described on https://iclr.cc/Conferences/2024/AuthorGuide.

Anonymous Url: I certify that there is no URL (e.g., github page) that could be used to find authors' identity.

No Acknowledgement Section: I certify that there is no acknowledgement section in this submission for double blind review.

Submission Number: 2744

Loading