Real Sto Return,Real Det violation,Running Forward KL,Itration,Running Env Steps,Real Det Return,Real Sto violation,Running Reverse KL,Running Update Time,Reward Loss
-203.72,0.6,18.2836,0,0,-1617.71,1.0,11.9048,0,821.6703491210938
-217.6,0.95,19.133,1,5000,-1772.66,1.0,11.9418,1,822.6505737304688
-321.38,0.2,18.3433,2,10000,-1533.55,1.0,11.3725,2,742.5924682617188
