Real Det violation,Reward Loss,Real Sto Return,Real Det Return,Running Update Time,Real Sto violation,Running Reverse KL,Itration,Running Forward KL,Running Env Steps
0.0,53.13185501098633,70.48,281.66,0,1.0,8.9582,0,16.2562,0
0.0,82.26153564453125,61.38,616.24,1,1.0,8.4773,1,16.1735,5000
