Best hyperparameters for reward shaping experiments

ENV MODE STEP_SIZE R_AUX_WEIGHT
dayan none 0.3 0.
dayan SR_wang 0.3 0.25
dayan SR_potential 1.0 0.75
dayan DR_potential 1.0 0.75
dayan_2 none 0.3 0.
dayan_2 SR_wang 0.3 0.25
dayan_2 SR_potential 0.3 0.5
dayan_2 DR_potential 1.0 0.75
fourrooms none 0.3 0.
fourrooms SR_wang 0.3 0.25
fourrooms SR_potential 0.3 0.5
fourrooms DR_potential 1.0 0.75
fourrooms_2 none 0.3 0.
fourrooms_2 SR_wang 1.0 0.75
fourrooms_2 SR_potential 1.0 0.25
fourrooms_2 DR_potential 1.0 0.75
gridroom none 0.3 0.
gridroom SR_wang 0.3 0.25
gridroom SR_potential 1.0 0.25
gridroom DR_potential 1.0 0.5
gridroom_2 none 1.0 0.
gridroom_2 SR_wang 1.0 0.25
gridroom_2 SR_potential 1.0 0.25
gridroom_2 DR_potential 1.0 0.5
gridmaze none 1.0 0.
gridmaze SR_wang 0.3 0.25
gridmaze SR_potential 1.0 0.25
gridmaze DR_potential 1.0 0.5
gridmaze_2 none 1.0 0.
gridmaze_2 SR_wang 1.0 0.5
gridmaze_2 SR_potential 1.0 0.25
gridmaze_2 DR_potential 1.0 0.5