Experiment 1: Multi-task Gridworld Dataset.
Instruction: 
For training, run the training part and comment out the test part in run.sh.
For test, run the test part and comment out the training part instead.