This repository contains scripts to run the experiments for the work "EvA-RL: Evaluation-aware Reinforcement Learning".

# Overview

The code is organized into two directories:

- `minatar_brax`: Contains scripts to run the experiments for the MinAtar and Brax environments.
- `gridworld`: Contains scripts to run the experiments for the gridworld environment.

## Setting up the environment

Please create a conda environment that uses python 3.10 and install the requirements.txt file.

```
conda create -n pred310 python=3.10 -y
conda activate pred310
pip install -r requirements.txt -y
```

## Important:
For running the experiments, set up your wandb entity in the appropriate places in the code.

This code has been tested on a machine with 80GB GPU VRAM. If you have less VRAM, you can use smaller transformer models, batch sizes and lower number of driver's test states.

The detailed instructions for running the experiments are in the README.md files in the minatar_brax and gridworld directories.
