FedPartial: Enabling Model-Heterogeneous Federated Learning via Partial Model Transmission and Aggregation

Published: 2024, Last Modified: 13 May 2025ICWS 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Federated learning (FL) is emerging as a new privacy-preserving learning paradigm that allows multiple devices to collaborate in training a model without sharing their raw data, using a central server for coordination. However, device heterogeneity poses a challenge in FL, as participating devices often have different computing capacities. To address this issue, heterogeneous models need to be designed to accommodate different device computing capacities. The existing approach involves pre-designing multiple heterogeneous models and extracting sub-models from the server model. While such an approach effectively tackles device heterogeneity, it has several drawbacks such as high communication overhead and insufficient personalization. In each training round, the server distributes the entire model parameters to each device, and each device also transmits the entire model parameters to the server for aggregation. In this work, we propose FedPartial, a new framework that overcomes these challenges by introducing a partial model transmission and aggregation mechanism. The FedPartial framework eliminates the need for devices to transmit the entire model parameters in each training round while still benefiting from global model aggregation. Specifically, FedPartial divides the device model into two parts: the shallow part participates in the global aggregation of heterogeneous models, while the deep part remains on the device locally. By keeping the deep part of the model on the device, FedPartial reduces the communication overhead significantly and achieves a certain degree of personalization. Through extensive experiments, we demonstrate that FedPartial outperforms existing state-of-the-art methods, particularly in more complex and statistically heterogeneous scenarios.
Loading