PAGAR Loss,Running Reverse KL,Running Env Steps,Itration,Real Det Return,Reward Loss,Running Forward KL,Real Sto Return,Running Update Time
90.99569239843349,11.4686,0,0,-25.29,2743699.5,31.7676,-155.79,0
54.038784755576486,11.9275,5000,1,1.0,2895614.0,31.9314,-110.89,1
-35.47089560100656,12.8463,10000,2,2.6,2873008.25,32.3887,-83.63,2
