Empowering Neural Networks with Control and Planning Abilities

Shuyuan Wang; Philip D Loewen; Bhushan Gopaluni; Michael Forbes

Empowering Neural Networks with Control and Planning Abilities

Shuyuan Wang, Philip D Loewen, Bhushan Gopaluni, Michael Forbes

Published: 10 Oct 2024, Last Modified: 28 Oct 2024NeurIPS 2024 Workshop on Behavioral MLEveryoneRevisionsBibTeXCC BY 4.0

Keywords: RL;Imitation learning;Differentiable controller; end-to-end control; Control guided learning

Abstract: Learning effective behaviors requires both adaptability and structured planning, traditionally split between model-free and model-based methods. Differentiable control combines the strengths of both, but iLQR, a powerful nonlinear controller, lacks differentiability, limiting its use in end-to-end learning. Differentiating through extended iterations introduces scalability challenges, further hindering its application. We propose a framework that enables iLQR to function as a trainable and differentiable module, either as or within a neural network, by using implicit differentiation to compute accurate gradients with constant backward cost. On behavior imitation tasks across standard benchmarks, our method achieves up to 128x speedup (minimum 21x) over automatic differentiation and improves learning efficiency by $10^6$x compared to conventional neural policies. This framework equips neural networks with control and planning abilities, bridging control theory and behavioral learning.

Submission Number: 94

Loading