# Offline Imitation Learning with Imperfect Demonstrations (iLID)

## Installation

### Download `mujoco210`
```
wget https://github.com/deepmind/mujoco/releases/download/2.1.0/mujoco210-linux-x86_64.tar.gz
tar -xvf mujoco210-linux-x86_64.tar.gz
mkdir ~/.mujoco
mv mujoco210 ~/.mujoco

# Add environment paths in ~/.bashrc
# export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/lib/nvidia
# export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/root/.mujoco/mujoco210/bin
# export MUJOCO_PY_MUJOCO_PATH=/root/.mujoco/mujoco210
# export MUJOCO_KEY_PATH=$MUJOCO_KEY_PATH:/root/.mujoco
source ~/.bashrc
```

### Install dependencies

```bash
cd iLID && pip install -r requirements.txt
```

## Train imitation policies

Run this command:

```train
./run.sh
```

Datasets are selected through `env_e` and `env_s`. Average returns are saved in `./results` 
(also viewable in [W&B](https://wandb.ai/)).