# Neural Bayesian Filtering (NBF)

This repository contains the code for the paper **Neural Bayesian Filtering** submitted to ICLR 2026.

The code is organized into three main directories:
    - `src/gridworld/` for the Gridworld experiments,
    - `src/goofspiel/` for the Goofspiel experiments,
    - `src/triangulation/` for the Triangulation experiments.

Each directory contains scripts for training and evaluation of models.

## Setting up the Environment

To replicate our environment, use Python 3.12 and install the Python packages found in `requirements.txt` to a virtual environment by executing the following commands:

```bash
python3 -m venv .venv
source .venv/bin/activate
python3 -m pip install --upgrade pip
python3 -m pip install -r requirements.txt
python3 -m pip install -e .
```

After a successful installation, patch the `sbx` library to support action masking by running:

```bash
./tools/patch_sbx.sh
```

## Running Experiments

Follow these instructions to replicate the results found in our submission. All training and evaluation scripts contain information about command-line arguments within them.

### Gridworld Experiments

The two main files for training models are named `src/gridworld/train_flow.py` (for Approx Beliefs and NBF) and `src/gridworld/train_rnn.py` (for Recurrent). The files are structured the same, here is an example usage demonstrating how to train a flow model on a random 5x5 2D grid, saving the model in directory `gridworld_models/`:

```bash
python3 src/gridworld/train_flow.py --nsteps=1000 --checkpoint_interval=500 --seed=10
```

The file `src/gridworld/evaluate.py` evaluates all Gridworld models. It expects a directory where both flow and recurrent models have been saved by the training scripts. It uses Gridworld parameters defined by the results of model training. This will produce an `f'eval_{args.flow_run_name}_{args.rnn_run_name>}'` subdirectory in `gridworld_models/`.

```bash
python3 src/gridworld/evaluate.py --nparticles_pf 16 32 64 --nparticles_nf 16 32 64
```

### Goofspiel Experiments

Before running experiments, we need to generate a sequence of policies our training scripts expect. This can be done by running the following command:

```bash
python3 src/goofspiel/generate_policies.py --num_cards=5 --num_envs=16 --self_play_iters=1 --train_iters=8 --train_timesteps=1024 --seed=10
```

The two main files for training models are named `src/goofspiel/train_flow.py` (for Approx Beliefs and NBF) and `src/goofspiel/train_rnn.py` (for Recurrent). The files are structured the same, here is an example usage demonstrating how to train a model for a five-card Goofspiel:

```bash
python3 src/goofspiel/train_flow.py --num_cards=5 --num_epochs=32 --batch_size=32 --num_samples=64 --seed=10
```

The file `src/goofspiel/filter.py` evaluates the particle filter and a given pair of flow and recurrent models. It expects the names of saved flow and recurrent models produced by the `train_flow.py` and `train_rnn.py` scripts as input. The following command evaluates given checkpoints in a five-card Goofspiel:

```bash
python3 src/goofspiel/filters.py --num_cards=5 --num_policy_pairs=1 --num_eval_episodes=10 --flow_model_ckpt='model-flow-05-32-10.eqx' --rnn_model_ckpt='model-rnn-05-32-10.eqx' --seed=10
```

### Triangulation Experiments

Triangulation experiments follow a very similar structure to the Gridworld experiments. To train a flow, we run the following command:

```bash
python3 src/triangulation/train_flow.py --nsteps=100 --checkpoint_interval=50 --seed=10
```

Finally, we can evaluate the trained models using:

```bash
python3 src/triangulation/evaluate.py --num_pf_particles 16 32 64 --num_nbf_particles 16 32 64
```

### Donuts Experiments

Training and plotting code for the toy domain introduced in the paper can be found in the `notebooks/` directory. The `donuts_demo.ipynb` notebook contains code for training Normalizing Flow models and `flow_matching_demo.ipynb` follows the same structure but uses Conditional Flow Matching.
