Reward Loss,Real Det Return,Running Update Time,Running Reverse KL,Real Sto Return,Running Env Steps,Itration,Running Forward KL
169216.765625,810.31,0,2102.0215,-143.44,0,0,147.2849
244763.515625,827.22,1,1782.3902,-102.1,5000,1,146.6115
