To train reacher_heard by SEGNN:
    1. cd Reacher
    
    2. python train.py actor_type=segnn critic_type=segnn pixel_obs=false action_repeat=1 frame_stack=1 task= reacher_hard agent=ddpg_e3 lr= 5e-5
    3. The The results will be saved at ./exp

The code is adapted from "Continuous MDP Homomorphisms and Homomorphic Policy Gradient" by Sahand Rezaei-Shoshtari, Rosie Zhao, Prakash Panangaden, David Meger, and Doina Precup, presented at the Advances in Neural Information Processing Systems (NeurIPS) conference in 2022. We gratefully acknowledge their significant contributions to this field.
