
Mean and Std for totalReward:
HyperRNN: Mean = 14.8359, Std = 7.9094
RNNEXP: Mean = 12.9380, Std = 8.2125
RNNIMP: Mean = 2.3184, Std = 11.8641

Tukey HSD results for totalReward:
  Multiple Comparison of Means - Tukey HSD, FWER=0.05  
=======================================================
 group1  group2 meandiff p-adj  lower    upper   reject
-------------------------------------------------------
HyperRNN RNNEXP  -1.8979   0.0  -2.7114  -1.0843   True
HyperRNN RNNIMP -12.5175   0.0  -13.331 -11.7039   True
  RNNEXP RNNIMP -10.6196   0.0 -11.4332   -9.806   True
-------------------------------------------------------

Pairwise p-values:
[1.42837163e-07 9.19375687e-13 9.19375687e-13]

Mean and Std for totalSteps:
HyperRNN: Mean = 49.7853, Std = 12.8742
RNNEXP: Mean = 53.5873, Std = 13.1039
RNNIMP: Mean = 63.8493, Std = 12.4133

Tukey HSD results for totalSteps:
 Multiple Comparison of Means - Tukey HSD, FWER=0.05 
=====================================================
 group1  group2 meandiff p-adj  lower   upper  reject
-----------------------------------------------------
HyperRNN RNNEXP    3.802   0.0  2.7058  4.8982   True
HyperRNN RNNIMP   14.064   0.0 12.9678 15.1602   True
  RNNEXP RNNIMP   10.262   0.0  9.1658 11.3582   True
-----------------------------------------------------

Pairwise p-values:
[9.21041021e-13 9.19375687e-13 9.19375687e-13]

Mean and Std for totalCollisions:
HyperRNN: Mean = 0.0133, Std = 0.1147
RNNEXP: Mean = 0.0160, Std = 0.1255
RNNIMP: Mean = 0.0280, Std = 0.1650

Tukey HSD results for totalCollisions:
Multiple Comparison of Means - Tukey HSD, FWER=0.05 
====================================================
 group1  group2 meandiff p-adj  lower  upper  reject
----------------------------------------------------
HyperRNN RNNEXP   0.0027 0.8547 -0.009 0.0144  False
HyperRNN RNNIMP   0.0147 0.0094  0.003 0.0264   True
  RNNEXP RNNIMP    0.012 0.0432 0.0003 0.0237   True
----------------------------------------------------

Pairwise p-values:
[0.85470954 0.00936851 0.04315071]

Mean and Std for totalBoundary:
HyperRNN: Mean = 0.0020, Std = 0.0447
RNNEXP: Mean = 0.0060, Std = 0.0772
RNNIMP: Mean = 0.0127, Std = 0.1118

Tukey HSD results for totalBoundary:
 Multiple Comparison of Means - Tukey HSD, FWER=0.05 
=====================================================
 group1  group2 meandiff p-adj   lower  upper  reject
-----------------------------------------------------
HyperRNN RNNEXP    0.004 0.3808 -0.0031 0.0111  False
HyperRNN RNNIMP   0.0107 0.0012  0.0036 0.0177   True
  RNNEXP RNNIMP   0.0067 0.0696 -0.0004 0.0137  False
-----------------------------------------------------

Pairwise p-values:
[0.38082679 0.00119555 0.06963791]
