
Mean and Std for totalReward:
HyperRNN: Mean = 11.7438, Std = 8.8796
RNNEXP: Mean = 9.8419, Std = 9.2187
RNNIMP: Mean = 0.1112, Std = 11.7164

Tukey HSD results for totalReward:
  Multiple Comparison of Means - Tukey HSD, FWER=0.05  
=======================================================
 group1  group2 meandiff p-adj  lower    upper   reject
-------------------------------------------------------
HyperRNN RNNEXP  -1.9019   0.0  -2.7599   -1.044   True
HyperRNN RNNIMP -11.6326   0.0 -12.4905 -10.7747   True
  RNNEXP RNNIMP  -9.7307   0.0 -10.5886  -8.8727   True
-------------------------------------------------------

Pairwise p-values:
[6.32001067e-07 9.19375687e-13 9.19375687e-13]

Mean and Std for totalSteps:
HyperRNN: Mean = 54.7680, Std = 14.3352
RNNEXP: Mean = 57.7733, Std = 13.6862
RNNIMP: Mean = 65.7313, Std = 11.1561

Tukey HSD results for totalSteps:
Multiple Comparison of Means - Tukey HSD, FWER=0.05 
====================================================
 group1  group2 meandiff p-adj lower   upper  reject
----------------------------------------------------
HyperRNN RNNEXP   3.0053   0.0 1.8808  4.1298   True
HyperRNN RNNIMP  10.9633   0.0 9.8388 12.0878   True
  RNNEXP RNNIMP    7.958   0.0 6.8335  9.0825   True
----------------------------------------------------

Pairwise p-values:
[1.21705201e-09 9.19375687e-13 9.19375687e-13]

Mean and Std for totalCollisions:
HyperRNN: Mean = 0.0147, Std = 0.1202
RNNEXP: Mean = 0.0147, Std = 0.1202
RNNIMP: Mean = 0.0313, Std = 0.1742

Tukey HSD results for totalCollisions:
Multiple Comparison of Means - Tukey HSD, FWER=0.05 
====================================================
 group1  group2 meandiff p-adj  lower  upper  reject
----------------------------------------------------
HyperRNN RNNEXP      0.0    1.0 -0.012  0.012  False
HyperRNN RNNIMP   0.0167 0.0034 0.0046 0.0287   True
  RNNEXP RNNIMP   0.0167 0.0034 0.0046 0.0287   True
----------------------------------------------------

Pairwise p-values:
[1.         0.00337108 0.00337108]

Mean and Std for totalBoundary:
HyperRNN: Mean = 0.0067, Std = 0.0814
RNNEXP: Mean = 0.0067, Std = 0.0814
RNNIMP: Mean = 0.0160, Std = 0.1255

Tukey HSD results for totalBoundary:
 Multiple Comparison of Means - Tukey HSD, FWER=0.05 
=====================================================
 group1  group2 meandiff p-adj   lower  upper  reject
-----------------------------------------------------
HyperRNN RNNEXP      0.0    1.0 -0.0084 0.0084  False
HyperRNN RNNIMP   0.0093 0.0254  0.0009 0.0178   True
  RNNEXP RNNIMP   0.0093 0.0254  0.0009 0.0178   True
-----------------------------------------------------

Pairwise p-values:
[1.         0.02538227 0.02538227]
