Real Det violation,Running Reverse KL,Real Sto Return,Real Sto violation,Itration,Reward Loss,Running Update Time,Running Forward KL,Real Det Return,Running Env Steps
0.85,11.2437,-217.68,1.0,0,774.2619018554688,0,18.283,-1672.24,0
