Running Reverse KL,Running Env Steps,Real Sto violation,Reward Loss,Real Det violation,Real Det Return,Real Sto Return,Running Update Time,Running Forward KL,Itration
10.6191,0,0.75,-92.31995391845703,0.0,1750.34,1802.23,0,16.2606,0
