Guiding The Last Layer in Federated Learning with Pre-Trained Models

Gwen Legate; Nicolas Bernier; Lucas Caccia; Edouard Oyallon; Eugene Belilovsky

Guiding The Last Layer in Federated Learning with Pre-Trained Models

Gwen Legate, Nicolas Bernier, Lucas Caccia, Edouard Oyallon, Eugene Belilovsky

Published: 21 Sept 2023, Last Modified: 02 Nov 2023NeurIPS 2023 posterEveryoneRevisionsBibTeX

Keywords: federated learning; transfer learning; nearest mean classifier; continual learning;

TL;DR: We show in the federated learning from pre-trained model setting that a simple approach of computing a nearest mean classifier, then finetuning the model can have large advantages

Abstract: Federated Learning (FL) is an emerging paradigm that allows a model to be trained across a number of participants without sharing data. Recent works have begun to consider the effects of using pre-trained models as an initialization point for existing FL algorithms; however, these approaches ignore the vast body of efficient transfer learning literature from the centralized learning setting. Here we revisit the problem of FL from a pre-trained model considered in prior work and expand it to a set of computer vision transfer learning problems. We first observe that simply fitting a linear classification head can be efficient in many cases. We then show that in the FL setting, fitting a classifier using the Nearest Class Means (NCM) can be done exactly and orders of magnitude more efficiently than existing proposals, while obtaining strong performance. Finally, we demonstrate that using a two-stage approach of obtaining the classifier and then fine-tuning the model can yield rapid convergence and improved generalization in the federated setting. We demonstrate the potential our method has to reduce communication and compute costs while achieving better model performance.

Supplementary Material: zip

Submission Number: 9369

Loading