FRACTAL: Data-Aware Clustering and Communication Optimization for Decentralized Federated Learning

Qianpiao Ma, Jianchun Liu, Hongli Xu, Qingmin Jia, Renchao Xie

Published: 01 Oct 2025, Last Modified: 08 Feb 2026IEEE Transactions on Big DataEveryoneRevisionsCC BY-SA 4.0

Abstract: Decentralized federated learning (DFL) is a promising technique to enable distributed machine learning over edge nodes without relying on a centralized parameter server. However, existing DFL network topologies, such as fully connected, partially connected, or lower-tier hierarchical topology often struggle to effectively address the unique challenges presented by edge networks, including edge heterogeneity, communication resource constraint, and data Non-IID. In order to tackle these challenges, we propose a data-aware clustering algorithm, called FRACTAL, to construct a multi-tier hierarchical topology in a bottom-up manner taking into consideration both data distribution and communication efficiency for DFL. We theoretically explore the quantitative relationship between the convergence bound of multi-tier FL and the data distribution among each-tier servers. To further improve communication efficiency and address edge heterogeneity, we deploy a time-sharing communication scheduling algorithm within each fractal unit (the basic structure in FRACTAL consisting of multiple nodes and an aggregator), called magic mirror method (MMM), to determine the optimal order of model distributing and uploading for nodes. We conduct extensive experiments on the classical models and datasets to evaluate the performance of FRACTAL, and the results show that FRACTAL can significantly accelerate the DFL model training by 48.6%–72.3% compared with the state-of-the-art solutions.

External IDs:doi:10.1109/tbdata.2024.3403381