# Adapting Noise to Data: Generative Flows from 1D Processes



Install dependencies:

```bash
pip install -r requirements.txt
```

Ensure PyTorch is installed according to your system's CUDA version if GPU support is needed.

## Running Experiments

### 2‑D Targets
```bash
python -m learn_noise.cli_2d --config default
python -m learn_noise.cli_2d --config funnel
```

### Image Datasets
```bash
python -m learn_noise.cli_images --config mnist
python -m learn_noise.cli_images --config cifar10
```

### Common Flags
- `--baseline`: train the flow-matching baseline only.
- `--baseline-flow {linear,mmd,kac}`: select the baseline dynamics. Additional knobs (`--baseline-mmd-b`, `--baseline-kac-a`, `--baseline-kac-c`, …) control each flow.
- `--pretrain`: run quantile pretraining before joint optimisation.
- `--override key=value`: override any YAML config entry.
- `--no-wandb`: disable Weights & Biases logging (disabled in configs by default).

## Baseline Flow Summary

| Flow   | Description                               | Key Parameters                     |
|--------|-------------------------------------------|------------------------------------|
| linear | Linear interpolation between latent & data| `baseline_latent`                  |
| mmd    | MMD-inspired analytic flow                | `baseline_mmd_b`                   |
| kac    | Kac process with light-cone constraint     | `baseline_kac_a`, `baseline_kac_c` |

Default hyper-parameters are defined in the YAML configs under `learn_noise/configs/`.


## Project Structure

```
Learn-Noise/
├── learn_noise/                      # Python package with training and evaluation code
│   ├── cli.py                        # Entry point selecting between task-specific CLIs
│   ├── cli_2d.py                     # CLI for 2-D toy target experiments
│   ├── cli_images.py                 # CLI for image experiments (MNIST, CIFAR-10)
│   ├── configs/                      # YAML configuration files
│   │   ├── 2d/                       # Configs for 2-D targets
│   │   └── images/                   # Configs for image datasets
│   ├── networks/                     # Flow-matching architectures and wrappers
│   │   ├── model_wrapper.py          # Model orchestration utilities
│   │   └── unet_oai.py               # U-Net backbone used for images
│   ├── training/                     # Training loops, optimisers, logging helpers
│   │   ├── train_fm_baseline.py      # Flow-matching baseline trainer
│   │   ├── train_fm_quantile.py      # Quantile-enhanced trainer
│   │   └── quantile_losses.py        # Quantile-specific loss functions
│   ├── utils/                        # Plotting, evaluation, and sampling utilities
│   │   ├── evaluation.py             # Metric computation helpers
│   │   └── sampler.py                # Samplers for flow models
│   └── eval/                         # Additional evaluation scripts
│       └── run_fid.py                # FID evaluation driver
├── data/                             # Local datasets and generated samples (gitignored)
├── requirements.txt                  # Python dependencies
└── README.md                         # Project documentation
```


