## Debiased model-based Representations for Q-learning (DR.Q)

This is the code for reproducing our reported results in the main text. Paper under review, please do NOT distribute.

The code borrows heavily from the [MR.Q codebase](https://github.com/facebookresearch/MRQ)

### Installing
Experiments were originally run with Python 3.9, but Python 3.10-3.12 is supported.

```
pip install -r requirements.txt
```

### Usage

Benchmark is designated by a prefix (Gym-, Dmc-, Dmc-visual-, HBench-) followed by the original environment name.

Example usage:
```
python main.py --env Gym-HalfCheetah-v4
python main.py --env Dmc-quadruped-walk
python main.py --env Dmc-visual-dog-run
python main.py --env HBench-h1-run-v0
```

Please note that you have to install HumanoidBench first to run DR.Q on HumanoidBench tasks. Please refer to [HumanoidBench codebase](https://github.com/carlosferrazza/humanoid-bench) for instructions on installing HumanoidBench.


### Results

The logged results of DR.Q can be found in `results/drq.csv`
