Natural Policy Gradient Preserves Spatial Decay Properties for Control of Networked Dynamical Systems

Eric Xu, Guannan Qu

Published: 19 Jan 2024, Last Modified: 28 Jan 2026CDCEveryoneCC BY 4.0

Abstract: We consider the distributed control of networked linear time-invariant systems. Previous work has established the spatial decay property of the centralized controller, which allows truncating the centralized controller to obtain a k-hop distributed controller with small performance loss. This paper makes a step further by showing a policy optimization approach, Natural Policy Gradient (NPG), preserves the spa-tial decay property of controllers. This enables “truncating” Natural Policy Gradient to directly learn a k-hop distributed controller.