Infeasible optimization problems and the hierarchical augmented Lagrangian method in imitation learning
Keywords: Augmented Lagrangian Method, Imitation learning, safety
Abstract: Imitation learning (IL) is an effective approach to train complex robotics policies.
Recent works have introduced hard constraints into imitation-learning optimization problems to ensure safety, stability, and robustness of the learned policy.
However, we argue that these constraints are sometimes infeasible, which can lead to unstable or difficult training dynamics.
We study a simple remedy for such situations based on recent theoretical results on the augmented Lagrangian method in infeasible settings.
We show that our approach drives the learned policy toward the solution of a closest-feasible constrained IL problem with desirable properties.
The method is illustrated on a toy driving example with a total-acceleration constraint and pedestrian-safety constraints, a setting in which infeasibility can naturally arise while still allowing a safe learned policy.
Submission Number: 53
Loading