# Code Instruction for HyMTRL

This repository hosts the PyTorch implementation of "**HyMTRL: A Hybrid Multi-Task Reinforcement Learning Framework via Phased Policy Evolution**" (HyMTRL) on **MetaWorld Benchmark**.

**NOTE**:

The code is based on the [CTPG](https://github.com/DarkDawn233/CTPG) and [MTRL](https://github.com/facebookresearch/mtrl) codebase.

The [MetaWorld Benchmark](https://meta-world.github.io/) requires extra installation. Since MetaWorld is under active development, all experiments are performed on the stable release version v2.0.0: https://github.com/Farama-Foundation/Metaworld/tree/v2.0.0.



## Setup

1. Set up the working environment: 

```shell
pip install -r requirements.txt
```

2. Set up the MetaWorld benchmark: 

First, install the mujoco-py package by following the [instructions](https://github.com/openai/mujoco-py#install-mujoco).

Then, install MetaWorld:

```shell
pip install git+https://github.com/Farama-Foundation/Metaworld.git@v2.0.0
```



## Training

Use the `scripts/start.sh` script to quickly run the code as follows:

```shell
bash scripts/start.sh $alg $map
```

- `$alg` includes: `hy_mtsac`, `hy_mhsac`, `hy_pcgrad`, `hy_sm`, `hy_paco`, `hy_moore`
- `$map` includes: `mt10`, `mt50`

For example, to run `MHSAC w/ HyMTRL` on the `MetaWorld-MT50` setup:

```shell
bash scripts/start.sh hy_mhsac mt50
```

All results will be saved in the `log` folder.



## See Also

Refer to [CTPG](https://github.com/DarkDawn233/CTPG), [MTRL](https://github.com/facebookresearch/mtrl), [MetaWorld](https://github.com/Farama-Foundation/Metaworld), [mujoco-py](https://github.com/openai/mujoco-py) for additional instructions.