# Best-Policy Identification (BPI)

Python experiments for best-policy identification on tabular environments (e.g., RiverSwim variants).

## Requirements
To run the examples you need at least Python 3.10 and the following libraries installed: `numpy scipy cvxpy mosek torch matplotlib notebook tqdm seaborn pandas cython`

## Setup
- Dependencies are Python-based (NumPy/TQDM, etc.). If needed, install your environment packages via pip/conda.
- `run.py` attempts to use an optional Cython acceleration (`utils/cutils.pyx`) via `pyximport`; it falls back to pure Python if the build/import fails.

## Run
Run from within this folder:
- `python run.py`

This writes compressed result files under `bpi/data/<env>/` (e.g., `*.pkl.lzma`).

## Plotting
- `make_plots.ipynb` contains plotting/analysis utilities (notebook outputs are cleared for submission hygiene).

