Real Sto Return,Running Reverse KL,Cost Loss,Running Forward KL,Itration,Real Det Return,Real Sto violation,Running Update Time,Real Det violation,Running Env Steps
-23.91,10.0242,37.79645538330078,18.0935,0,-403.59,1.0,0,0.65,0
-188.58,10.7877,67.73648834228516,18.4149,1,-1125.36,1.0,0,0.95,5000
-162.44,10.5515,49.34808349609375,18.7741,2,-1470.67,1.0,0,1.0,10000
