Itration,Running Update Time,Reward Loss,Real Sto Return,Running Forward KL,Cost Loss,Real Sto violation,Running Env Steps,Real Det Return,Running Reverse KL,Real Det violation
0,0,42.33637619018555,1781.28,16.2604,345.01312255859375,0.0,0,1750.3,12.6883,0.0
1,1,47.843994140625,1812.4,15.6952,349.01177978515625,0.0,5000,1757.76,14.0851,0.0
