#### Hierarchical Policies Should Be Mutually Responsive: Bidirectional-Reachable Hierarchical RL



To visualize the performance of BrHPO, 

```python
python render_result.py  logs/AntMaze/visualization 6000  # for AntMaze



python render_result.py  logs/AntPush/visualization 6000  # for AntPush



python render_result.py  logs/Reacher3D/visualization 4800  # for Reacher3D
```



To train BrHPO,

```python
python main.py --env_name AntMaze
```

