PAGAR Loss,Running Env Steps,Real Sto Return,Reward Loss,Running Forward KL,Running Reverse KL,Real Det Return,Running Update Time,Itration
nan,0,-55.75,10541.4404296875,146.2156,2008.6725,937.67,0,0
