Running Reverse KL,Running Update Time,Real Sto violation,Real Det Return,Itration,Running Forward KL,Running Env Steps,Real Det violation,Reward Loss,Real Sto Return
8.4876,0,1.0,135.65,0,15.8389,0,0.0,0.24563074111938477,207.79
8.882,1,1.0,134.76,1,16.0749,5000,0.0,6.0166120529174805,186.5
