Reward Loss,Itration,Running Env Steps,Running Reverse KL,Running Forward KL,Real Sto Return,Real Det violation,Real Det Return,Real Sto violation,Running Update Time
-117.39988708496094,0,0,10.3242,16.932,1772.33,0.0,1750.48,1.0,0
