Running Reverse KL,Running Update Time,Running Env Steps,Real Sto violation,Reward Loss,Running Forward KL,Real Det violation,Itration,Real Sto Return,Real Det Return
13.2085,0,0,1.0,895.8283081054688,18.8032,0.05,0,-412.68,-1673.96
13.4776,1,5000,1.0,860.6958618164062,19.2146,0.05,1,-386.8,-1563.93
13.1328,2,10000,1.0,790.1640625,19.3282,0.0,2,-381.7,-1663.65
13.9681,3,15000,1.0,808.2021484375,19.4961,0.2,3,-365.03,-1675.12
