Reward Loss,Running Update Time,Itration,Real Sto violation,Real Sto Return,Real Det violation,Real Det Return,Running Reverse KL,Running Env Steps,Running Forward KL
-137.95973205566406,0,0,0.9,1761.14,0.0,1749.72,10.1354,0,17.1261
