Abstract: Highlights•An augmented Lagrangian-based method for stable and safe RL with efficient training.•An extension with learned barrier certificates, supported by theoretical guarantees.•Investigation into the infeasibility due to the presence of multiple constraints.•An algorithm using neural ODEs for improved modeling performance with public code.
Loading