Real Det Return,Real Sto Return,Running Env Steps,Running Reverse KL,Running Update Time,Reward Loss,Itration,Running Forward KL
-10.16,-182.2,0,11.3062,0,2082.486572265625,0,31.6748
