# Supplementary Material Code (`supplementary_material_icml_8444/`)

This folder contains the code used to reproduce the main supplementary experiments.
It is organized into three self-contained submodules:

- `gridworld/`: tabular MiniGrid-style FourRooms experiments (coverage / cover-time / reward collection / SR-target ablation)
- `visual_gridworld/`: visual gridworld experiments + comparison framework (**FPVR** vs SR/SP baselines)
- `pbe_dqn_test/`: Atari **FPVR-DQN** (pure exploration with replay buffer; directory name kept for legacy reasons)

This repository intentionally ignores generated artifacts (e.g., `results/`, `runs/`, checkpoints) via the local `.gitignore`.

## Environment

- Python: **3.9+** (tested with Python 3.10)
- OS: Windows/Linux/macOS should work (paths are handled carefully in the scripts)

## Quickstart

From this directory:

```bash
cd supplementary_material_icml_8444
```

### 1) `pbe_dqn_test/` (Atari FPVR-DQN)

Install dependencies:

```bash
python -m pip install -r pbe_dqn_test/requirements.txt
```

Run training (example: Breakout):

```bash
python pbe_dqn_test/main.py --env_name ALE/Breakout-v5
```

More details: see `pbe_dqn_test/README.md`.

### 2) `visual_gridworld/` (visual FPVR + comparisons)

Minimal dependencies (in addition to the usual scientific stack):

```bash
python -m pip install minigrid gymnasium
```

Run the default visual FPVR:

```bash
python visual_gridworld/fpvr_run.py
```

Run with a preset / JSON config:

```bash
python visual_gridworld/fpvr_run.py --preset fast_test --visualize false
python visual_gridworld/fpvr_run.py --config visual_gridworld/config_example.json --beta 2.0 --steps 5000
```

Run the multi-method comparison (outputs under `visual_gridworld/results/`):

```bash
cd visual_gridworld
python exploration_comparison.py --methods random sr sp deep_fpvr tabular_fpvr --n_seeds 3 --total_steps 20000 --coverage_reset_interval 2000
cd ..
```

More details: see `visual_gridworld/README_comparison.md`.

### 3) `gridworld/` (tabular FourRooms)

Install the optional environment dependency:

```bash
python -m pip install minigrid
```

Coverage comparison (saves figures under `gridworld/results/` by default):

```bash
cd gridworld
python fourrooms_exploration.py --total_steps 3000 --n_seeds 50 --horizon 500
cd ..
```

Other scripts:

```bash
cd gridworld
python fourrooms_cover_time.py --n_runs 50 --max_steps 30000
python fourrooms_reward_experiment.py --total_steps 200000 --refresh_k 40000 --eval_every 500 --n_seeds 10
python fourrooms_sr_target_test.py --total_steps 3000 --n_seeds 50
cd ..
```

## Notes on outputs

- `gridworld/` defaults to `gridworld/results/`
- `visual_gridworld/` defaults to `visual_gridworld/results/`
- `pbe_dqn_test/` defaults to `pbe_dqn_test/runs/`

These directories are ignored by `supplementary_material_icml_8444/.gitignore` to keep the submission clean.

