Running Update Time,Reward Loss,Real Det Return,Running Reverse KL,Running Forward KL,PAGAR Loss,Running Env Steps,Real Sto Return,Itration
0,-119322.109375,845.66,2061.6047,149.7745,nan,0,-151.65,0
