# 🔥 FIRE: Multi-fidelity Regression with Distribution-conditioned In-context Learning


Official implementation of **"FIRE: Multi-fidelity Regression with Distribution-conditioned In-context Learning using Tabular Foundation Models"** (ICML 2026).

## 📋 Overview

FIRE (**F**idelity-aware **I**n-context **RE**gression) is a novel multi-fidelity regression framework that leverages Tabular Foundation Models (TFMs) to achieve zero-shot Bayesian inference under extreme data imbalance. Our key contributions include:

- 🎯 **Distribution-conditioned Residual Learning**: Augment residual model inputs with full predictive distributions (mean, variance, quantiles) from lower fidelity
- 🚀 **Zero-shot In-context Learning**: No gradient-based training required for regression tasks
- 📊 **State-of-the-art Performance**: Consistently outperforms GP-based and deep learning baselines across diverse benchmarks

## 🛠️ Installation

### Environment Setup

```bash
conda create -n FIRE python=3.12
conda activate FIRE
pip install -r requirements.txt
```

### TabPFN Setup (Required for FIRE_TFM)

Install TabPFN v2.5:
```bash
pip install tabpfn
```

### BNN baseline setup

Install MFBNN: Follow the instruction on https://github.com/bessagroup/mfbml to create a new environment and pip install the package.



### Alternative TFMs
Recommendation: Make a conda environment for each TFM. 
#### TabPFN v2 (Optional)
```bash
pip install tabpfn==2.0.6
```

#### Mitra (Optional)
```bash
pip install autogluon.tabular[mitra] 
```

### ⚙️ TabPFN Modifications for Variance/Quantile Support

To enable variance and quantile predictions with TabPFN, you need to modify the TabPFN source code:

1. **Locate the TabPFN installation**:
```bash
python -c "import tabpfn; print(tabpfn.__file__)"
```
Or install it using the GitHub developer mode
```bash
git clone https://github.com/PriorLabs/TabPFN.git --depth 1
```

2. **Edit the regressor file** (typically `tabpfn/regressor.py`):

Find the `predict` method and add support for `output_type` parameter:

```python
# Around line 150-200 in regressor.py
def _logits_to_output(
    *,
    output_type: str,
    logits: torch.Tensor,
    criterion: FullSupportBarDistribution,
    quantiles: list[float],
) -> np.ndarray | list[np.ndarray]:
    # ... existing code ...

    if output_type == "mean":
        return predictions.mean(axis=0)
    elif output_type == "variance":
        return predictions.var(axis=0)
    
    # ... existing code ...
```

Then change the output type settings at line ~100
```python
# ... existing code ...
_OUTPUT_TYPES_BASIC = ("mean", "median", "mode", "variance")
# ... existing code ...
OutputType = Literal["mean", "median", "mode", "quantiles", "full", "main", "variance"]
```


## 📦 Repository Structure

```
├── README.md
├── requirements.txt
├── example.py              # Usage examples with synthetic data
└── src/
    ├── util.py             # Data loading and encoding utilities
    ├── FIRE.py             # 🔥 FIRE (TFM) and FIRE_GP implementations
    ├── AR1.py              # AR(1) baseline
    ├── ResGP.py            # Residual GP baseline
    ├── NARGP.py            # Non-linear Autoregressive GP
    ├── ContinuAR.py        # ContinuAR GP
    ├── MFKG.py             # Multi-fidelity Knowledge Gradient GP (BoTorch)
    ├── MFBNN.py            # Multi-fidelity Bayesian Neural Network
    └── MFRNP.py            # Multi-fidelity Residual Neural Process
```

## 🚀 Quick Start

See `example.py`


## 📊 Baselines Comparison

| Method | Type | Training | Key Features |
|--------|------|----------|--------------|
| **FIRE_TFM** | TFM | Zero-shot | Distribution-conditioned residuals |
| FIRE_GP | GP | Kernel Learning | Distribution-conditioned residuals |
| AR1 | GP | Kernel Learning | Linear correlation |
| ResGP | GP | Kernel Learning | Residual learning |
| NARGP | GP | Kernel Learning | Non-linear autoregression |
| ContinuAR | GP | Kernel Learning | Continuous fidelity ODE |
| MFKG | GP | Kernel Learning | Multi-fidelity Knowledge Gradient |
| MFBNN | DL | Deep Learning | Bayesian Neural Network |
| MFRNP | DL | Deep Learning | Residual Neural Process |


## 📂 Data Availability

🔒 **Note**: We are committed to ensuring reproducibility of all results. Upon acceptance, we will release the full source code, including the implementation of FIRE, data preprocessing scripts, benchmark configurations, and experiment pipelines. The synthetic examples provided in `example.py` demonstrate the API usage and can be used for testing.

