Model-Free Learning with Heterogeneous Dynamical Systems: A Federated LQR Approach

Model-Free Learning with Heterogeneous Dynamical Systems: A Federated LQR Approach

TMLR Paper5533 Authors

01 Aug 2025 (modified: 22 Nov 2025)Under review for TMLREveryoneRevisionsBibTeXCC BY 4.0

Abstract: We study a model-free federated linear quadratic regulator (LQR) problem where M agents with unknown, distinct yet similar dynamics collaboratively learn an optimal policy to minimize an average quadratic cost while keeping their data private. To exploit the similarity of the agents' dynamics, we propose to use federated learning (FL) to allow the agents to periodically communicate with a central server to train policies by leveraging a larger dataset from all the agents. With this setup, we seek to understand the following questions: (i) Is the learned common policy stabilizing for all agents? (ii) How close is the learned common policy to each agent's own optimal policy? (iii) Can each agent learn its own optimal policy faster by leveraging data from all agents? To answer these questions, we propose the federated and model-free algorithm FedLQR. Our analysis overcomes numerous technical challenges, such as heterogeneity in the agents’ dynamics, multiple local updates, and stability concerns. We show that FedLQR produces a common policy that, at each iteration, is stabilizing for all agents. Moreover, we prove that when learning each agent's optimal policy, FedLQR achieves a sample complexity reduction proportional to the number of agents M in a low-heterogeneity regime, compared to the single-agent setting.

Submission Length: Long submission (more than 12 pages of main content)

Assigned Action Editor: ~Dileep_Kalathil1

Submission Number: 5533

Loading