Running Reverse KL,Running Forward KL,Real Det Return,Running Env Steps,Cost Loss,Real Sto violation,Real Sto Return,Itration,Running Update Time,Real Det violation
14.262,19.1912,-1735.79,0,12.270651817321777,1.0,-379.54,0,0,1.0
15.2408,20.7423,-934.48,5000,-35.349884033203125,1.0,-510.74,1,0,1.0
15.7651,20.9089,-1041.65,10000,-74.56341552734375,1.0,-342.08,2,0,1.0
14.7751,20.3028,-1302.37,15000,-78.46495819091797,1.0,-451.09,3,0,1.0
15.5708,20.6984,-1606.93,20000,-115.8636245727539,1.0,-491.17,4,0,1.0
16.2214,20.7014,-1567.55,25000,-152.93431091308594,1.0,-623.99,5,0,0.95
16.7492,21.5001,-1785.73,30000,-181.09359741210938,1.0,-622.2,6,0,0.95
