# MINTO OnlineRL

## Installation

CPU installation:
```bash
conda create -n minto python=3.10
condda activate minto
pip install --upgrade pip setuptools wheel
pip install -e .[dev]
```

GPU installation:
```bash
conda create -n minto python=3.10
condda activate minto
pip install --upgrade pip setuptools wheel
pip install -e .[dev,gpu]
```

## Running Experiments
To run an experiment, exectute one of the bash scripts in the root of the repository. For example, you can train MINTO on Breakout by running:
```bash
bash run_dqn.sh min Breakout
``` 
or training DQN:
```bash
bash run_dqn.sh default Breakout
``` 
These will run MINTO and DQN using the CNN architecture. In case of IMPALA, you should change the architecture to impala and add layer norm flag as follows:

```bash
--architecture_type impala --layer_norm
```