Real Sto Return,Running Env Steps,Reward Loss,Running Update Time,Itration,Real Det Return
-30.08,0,-11.001252174377441,0,0,-0.93
-41.72,500,-8.088377952575684,1,1,-3.15
-34.62,1000,-9.052704811096191,2,2,-2.11
-14.64,1500,0.8617662787437439,3,3,-3.11
-10.15,2000,4.440282821655273,4,4,-1.79
-9.12,2500,-6.001340866088867,5,5,-3.24
0.14,3000,3.243795871734619,6,6,-2.43
-4.38,3500,-1.8992736339569092,7,7,-2.95
-2.55,4000,-1.734887957572937,8,8,-2.54
-6.09,4500,-0.07697293907403946,9,9,-5.63
2.39,5000,2.3268954753875732,10,10,-4.01
3.75,5500,-4.261724948883057,11,11,-1.89
2.54,6000,-3.0714056491851807,12,12,13.12
-1.58,6500,-2.8925580978393555,13,13,13.16
1.19,7000,-3.3714230060577393,14,14,0.55
-0.35,7500,-0.8783912062644958,15,15,13.04
-0.67,8000,-1.8673598766326904,16,16,8.25
-2.64,8500,-1.829067587852478,17,17,1.15
-1.22,9000,-1.4131865501403809,18,18,12.45
-1.49,9500,0.6259000897407532,19,19,11.45
-1.88,10000,1.0500503778457642,20,20,12.46
-3.77,10500,-0.9324749112129211,21,21,7.31
-5.35,11000,-1.1105318069458008,22,22,8.23
