Running Forward KL,Running Update Time,Running Env Steps,Real Sto Return,Running Reverse KL,Real Det Return,Itration,Real Sto violation,Reward Loss,Real Det violation
17.9351,0,0,-172.54,11.5491,-1332.8,0,1.0,779.3175659179688,0.45
