Synthetic experiments (Gridworld, CartPole) and HighD experiments were performed using `code_ppo_pen`.
Mujoco experiments (Ant-Constrained, HalfCheetah-Constrained) and ExiD experiments were performed using `code_ppo_lag`.
