# FlowMAP: Flow Matching for generalizable Agent Planning

A novel approach to agent planning that formulates planning as a continuous-time flow matching to achieve robust generalization under dynamic heterogeneity in observations, dynamics, and objectives.

If you find this code useful, please reference in your paper:

```
@article{yourname2025flowmap,
  title={FlowMAP: Flow Matching for generalizable Agent Planning},
  author={Your Name and Co-authors},
  journal={Journal Name},
  pages={1--10},
  year={2025},
  publisher={Publisher}
}
```

## FlowMAP

FlowMAP addresses the challenge of dynamic heterogeneity in agent planning—nonstationary observations, dynamics, and objectives with sparse, delayed rewards—which dominant methods largely ignore, leading to poor generalization under environment shifts.

FlowMAP formulates planning as a continuous-time flow matching by learning a planning-time velocity field that transports an initial meta-state distribution toward a task-conditioned target. The method introduces Value-Transport Flow Matching to provide distribution-level planning objectives that steer transport toward high-value regions in meta-state distributions, mitigating error accumulation under environmental shifts.

To enforce alignment between meta-state distributions transport and action-environment interaction, FlowMAP further proposes Flow-Policy Co-Training, which jointly optimizes the planning flow and policy so that the flow transport directly regularizes the policy-induced meta-distribution dynamics.

Across diverse agent planning benchmarks, FlowMAP consistently outperforms strong baselines, yielding improvement in planning generalization.

# Instructions

The code has been tested on Linux and Mac and requires Python 3.11+.


## Manual

Install [JAX][jax] and then the other dependencies:

```sh
pip install -U -r requirements.txt
```

Training script:

```sh
python flowmap/main.py \
  --logdir ~/logdir/flowmap/{timestamp} \
  --configs crafter \
  --run.train_ratio 32
```

To reproduce results, train on the desired task using the corresponding config, such as `--configs atari --task atari_pong`.

To resume training from a checkpoint, use the same command with the same logdir:

```sh
python flowmap/main.py --logdir ~/logdir/flowmap/{timestamp} --configs atari --run.train_ratio 32
```

View results:

```sh
pip install -U scope
python -m scope.viewer --basedir ~/logdir --port 8000
```

Scalar metrics are also written as JSONL files.

# Tips

- All config options are listed in `flowmap/configs.yaml` and you can override them as flags from the command line.
- The `debug` config block reduces the network size, batch size, duration between logs, and so on for fast debugging (but does not learn a good model).
- By default, the code tries to run on GPU. You can switch to CPU or TPU using the `--jax.platform cpu` flag.
- You can use multiple config blocks that will override defaults in the order they are specified, for example `--configs crafter size50m`.
- By default, metrics are printed to the terminal, appended to a JSON lines file, and written as Scope summaries. Other outputs like WandB and TensorBoard can be enabled in the training script.
- If you get a `Too many leaves for PyTreeDef` error, it means you're reloading a checkpoint that is not compatible with the current config. This often happens when reusing an old logdir by accident.
- If you are getting CUDA errors, scroll up because the cause is often just an error that happened earlier, such as out of memory or incompatible JAX and CUDA versions. Try `--batch_size 1` to rule out an out of memory error.
- Many environments are included, some of which require installing additional packages. See the `Dockerfile` for reference.
- To continue stopped training runs, simply run the same command line again and make sure that the `--logdir` points to the same directory.

# Disclaimer

This repository contains an implementation of FlowMAP for agent planning. The implementation has been tested to work on a range of environments.

[jax]: https://github.com/google/jax#pip-installation-gpu-cuda