Picking Daisies in Private: Federated Learning from Small Datasets

Michael Kamp; Jonas Fischer; Jilles Vreeken

Picking Daisies in Private: Federated Learning from Small Datasets

Michael Kamp, Jonas Fischer, Jilles Vreeken

Published: 28 Jan 2022, Last Modified: 13 Feb 2023ICLR 2022 SubmittedReaders: Everyone

Keywords: federated learning, distributed, sparse data, daisy chain, small datasets

Abstract: Federated learning allows multiple parties to collaboratively train a joint model without sharing local data. This enables applications of machine learning in settings of inherently distributed, undisclosable data such as in the medical domain. In practice, joint training is usually achieved by aggregating local models, for which local training objectives have to be in expectation similar to the joint (global) objective. Often, however, local datasets are so small that local objectives differ greatly from the global objective, resulting in federated learning to fail. We propose a novel approach that intertwines model aggregations with permutations of local models. The permutations expose each local model to a daisy chain of local datasets resulting in more efficient training in data-sparse domains. This enables training on extremely small local datasets, such as patient data across hospitals, while retaining the training efficiency and privacy benefits of federated learning.

One-sentence Summary: We propose Federated Daisy-Chaining to train a model collaboratively from extremely small, distributed datasets by intertwining model aggregations with permutations of local models.

Supplementary Material: zip

5 Replies

Loading