Real Det violation,Running Env Steps,Running Forward KL,Real Sto violation,Running Update Time,Real Det Return,Real Sto Return,Reward Loss,Itration,Running Reverse KL
0.0,0,16.6466,0.95,0,1749.61,1776.54,-140.88980102539062,0,10.4077
