Real Sto Return,Running Env Steps,Real Det violation,Real Det Return,Real Sto violation,Running Reverse KL,Reward Loss,Running Forward KL,Running Update Time,Itration
91.94,0,0.0,-109.25,1.0,8.7211,92.15702819824219,15.9096,0,0
-19.72,5000,0.0,335.12,1.0,8.6517,97.03089141845703,15.6895,1,1
2.7,10000,0.0,269.7,1.0,9.0715,121.91548919677734,16.1517,2,2
44.73,15000,0.0,277.44,1.0,8.9065,67.7824478149414,15.8299,3,3
