
 ---------------------------- median -----------------------------
                 exp       cr  episode_len   kl_div      mse    range  succeed
model                                                                         
a2c_new         51.0  0.48965     18.00000  4.60779  0.37241  0.41891      1.0
lt_a2c_v2_pp10  49.0  0.94761     18.02022  0.00754  0.00038  0.45563      1.0
lt_a2c_v2_pp20  49.5  0.94703     18.02163  0.00918  0.00037  0.45090      1.0
lt_a2c_v2_pp40  49.0  0.94681     18.02092  0.00918  0.00031  0.44831      1.0
lt_ppo_v2_pp10  49.5  0.94662     18.02308  0.00910  0.00040  0.45868      1.0
lt_ppo_v2_pp20  49.5  0.94610     18.02164  0.00723  0.00036  0.45265      1.0
lt_ppo_v2_pp40  49.0  0.94607     18.02163  0.00743  0.00033  0.44856      1.0
ppo_new         50.0  0.47595     18.00000  4.79788  0.44824  0.41757      1.0
 ----------------------------  average ----------------------------
                     exp       cr  episode_len   kl_div      mse    range  succeed
model                                                                             
a2c_new         50.74227  0.52205     18.00015  4.55276  0.53992  0.42509      1.0
lt_a2c_v2_pp10  49.12121  0.94676     18.02119  0.01526  0.00040  0.45677      1.0
lt_a2c_v2_pp20  49.50000  0.94665     18.02191  0.01680  0.00040  0.45184      1.0
lt_a2c_v2_pp40  49.05051  0.94743     18.02399  0.01639  0.00036  0.45072      1.0
lt_ppo_v2_pp10  49.50000  0.94653     18.02303  0.01491  0.00042  0.45855      1.0
lt_ppo_v2_pp20  49.50000  0.94649     18.02239  0.01447  0.00039  0.45225      1.0
lt_ppo_v2_pp40  49.26263  0.94657     18.02185  0.01703  0.00034  0.44907      1.0
ppo_new         49.90722  0.51324     18.00019  4.65657  0.55031  0.42352      1.0
 -----------------------  standard deviation -----------------------
                     exp       cr  episode_len   kl_div      mse    range  succeed
model                                                                             
a2c_new         28.53553  0.12461      0.00051  0.96196  0.49962  0.05195      0.0
lt_a2c_v2_pp10  28.90953  0.00401      0.00423  0.02122  0.00014  0.00987      0.0
lt_a2c_v2_pp20  29.01149  0.00488      0.00532  0.01921  0.00014  0.00974      0.0
lt_a2c_v2_pp40  28.80702  0.00461      0.02151  0.01762  0.00020  0.01355      0.0
lt_ppo_v2_pp10  29.01149  0.00411      0.00487  0.01574  0.00014  0.00876      0.0
lt_ppo_v2_pp20  29.01149  0.00438      0.00565  0.01816  0.00013  0.00969      0.0
lt_ppo_v2_pp40  29.06137  0.00442      0.00475  0.02159  0.00011  0.00859      0.0
ppo_new         29.06354  0.11215      0.00052  0.98201  0.47658  0.04830      0.0
 ----------------------------  trimmed mean ----------------------------
                      cr       mse     range    kl_div
model                                                 
a2c_new         0.489289  0.535269  0.413356  4.646551
lt_a2c_v2_pp10  0.946985  0.000379  0.457066  0.010002
lt_a2c_v2_pp20  0.946741  0.000388  0.451844  0.013884
lt_a2c_v2_pp40  0.947481  0.000334  0.449475  0.013697
lt_ppo_v2_pp10  0.946708  0.000405  0.458284  0.011393
lt_ppo_v2_pp20  0.946690  0.000378  0.452091  0.009422
lt_ppo_v2_pp40  0.946473  0.000321  0.448589  0.010816
ppo_new         0.486634  0.561150  0.415594  4.773372
 ----------------------------  trimmed std ----------------------------
                      cr       mse     range    kl_div
model                                                 
a2c_new         0.060650  0.397434  0.022589  0.728775
lt_a2c_v2_pp10  0.003824  0.000107  0.008879  0.010447
lt_a2c_v2_pp20  0.004489  0.000122  0.009536  0.014449
lt_a2c_v2_pp40  0.003971  0.000111  0.009394  0.012567
lt_ppo_v2_pp10  0.004042  0.000121  0.008908  0.009867
lt_ppo_v2_pp20  0.004515  0.000119  0.009068  0.009408
lt_ppo_v2_pp40  0.004188  0.000076  0.007656  0.010873
ppo_new         0.065986  0.427208  0.024050  0.892590