<div align="center">

# RESCUE

### <u>RE</u>ducing <u>S</u>ampling cost with <u>C</u>ausal <u>U</u>nderstanding and <u>E</u>stimation

[![Python 3.10+](https://img.shields.io/badge/python-3.10+-blue.svg)](https://www.python.org/downloads/)

</div>

---

## Overview

**RESCUE** is a multi-objective multi-fidelity Bayesian optimization method that uses **causal reasoning** to achieve superior sample efficiency. Unlike traditional methods that capture only associational dependencies, RESCUE integrates **causal discovery and inference** to learn causal relationships between inputs, fidelities, and objectives. This enables robust optimization even when lower-fidelity proxies poorly approximate the target fidelity.

Built on [BoTorch](https://botorch.org/) and [GPyTorch](https://gpytorch.ai/), RESCUE seamlessly integrates with the BoTorch and GPyTorch ecosystems, allowing you to leverage existing BoTorch models, acquisition functions, optimization utilities, and GPyTorch's scalable Gaussian Process implementations. 

### Key Features
- **Multi-objective multi-fidelity optimization** with Pareto frontier discovery
- **Causal Gaussian Process probabilistic models** built using causal discovery (via [causal-learn](https://github.com/py-why/causal-learn)) and causal inference (via [DoWhy](https://github.com/py-why/dowhy)) for intervention modeling
- **Causal acquisition strategy** for intelligent cost-information trade-offs
- **BoTorch and GPyTorch integration** for seamless compatibility

### Demo

<video src="assets/rescue_demo.mp4" autoplay loop muted playsinline></video>

---

## Installation

### Automatic installation

* Linux and macOS: On `terminal` run the following command
    - Install `curl` if missing: `sudo apt install curl`
```bash
chmod +x install.sh && \
./install.sh && \
source .venv/bin/activate
```

* Windows: On `cmd` console run the following command
```console
install.bat && ^
.venv\Scripts\activate
``` 

### Manual installation
1. Install [uv](https://docs.astral.sh/uv/getting-started/installation/)

2. Install [Graphviz](https://graphviz.org/download/)

3. Sync using uv. Done!
```bash
uv sync --extra notebooks && \
source .venv/bin/activate
```

**Note:** For Windows activate using `.venv\Scripts\activate`

---

## Getting Started

Although RESCUE is modular, the easiest way to run is to use the RESCUE algorithm as follows:

1. Define the problem (using the XGBoost hyperparameter optimization problem)
```python
import torch
from rescue.problems.hpo.multi_objective import HPOXGBoost

problem = HPOXGBoost(negate=True).to("cuda" if torch.cuda.is_available() else "cpu")
```

2. Define the cost function
```python
from torch import Tensor

def cost_fn(X: Tensor) -> Tensor:
    """ cost(fidelity) = exp(4.8 * fidelity)"""
    return torch.exp(torch.tensor(4.8) * X[..., -1:])
```

3. Load the observational data to learn the Causal Performance Model
```python
from data import load_observational_data
# pandas dataframe
obs_data = load_observational_data(sample_seed=1, problem=problem)
```

4. Run the RESCUE algorithm
```python
from rescue.algorithms import RescueAlgorithmMultifidelity

rescue = RescueAlgorithmMultifidelity(
    problem=problem,
    design_variables=problem.design_var_names,
    objective_variables=problem.objective_var_names,
    fidelity_param_name=problem.fidelity_param_name,
    bounds=problem.bounds,
    is_discrete_fidelities=False,
    # RESCUE assumes fidelity is in the 
    # last dimension of the design space
    target_fidelities={len(problem.design_var_names) - 1: 1.0},
    cost_fn=cost_fn,
)
res = rescue.run(
    init_budget=1.05,
    budget=2.05,
    include_initcost_to_budget=False,
    ref_point=problem.ref_point,
    objective_indices=[0, 1],
    causal_observational_data=obs_data,
    causal_discovery="DirectLiNGAM",
    causal_net_epochs=200,
    causal_intervention_samples=500,    
    compute_metrics=False,
)
print(res['pareto_X'])
```

---

## 🔬 Reproducing Experimental Results

We used [Weights & Biases (W&B)](https://wandb.ai/site/) to log the optimization results. Therefore, an account is required.

All scripts to run the experiments are provided in the `scripts` folder. For example, to run the `XGBoost HPO` problem using different methods:

```bash
# RESCUE method
bash scripts/hpoxgboost/run_rescue_hpoxgboost.sh

# HVKG baseline
bash scripts/hpoxgboost/run_hvkg_hpoxgboost.sh

# MOMF baseline
bash scripts/hpoxgboost/run_momf_hpoxgboost.sh

# GP-qEHVI baseline
bash scripts/hpoxgboost/run_gp_qehvi_hpoxgboost.sh
```

**Note:**
- We ran the experiments on a single machine with `4 x RTX A6000` GPUs and predefined the device number (e.g., `cuda:0`).
- Depending on the system configuration, one would need to change the `--device` parameter value inside the script if multiple GPUs are available.
- If the user does not have GPU(s) it would need to set to `--device cpu`.

### Checking the Results

Once an experiment is completed, running the following command will show the results
```bash
bash scripts/hpoxgboost/results_hpoxgboost.sh 
```

**Note:**
- It is important to run all methods for a particular problem before running this script to get the experimental results.
- This will also ask for your W&B `entity` name, which can be found in your W&B user dashboard.

Optionally, to produce the plots reported in the paper, run the following
```python
python -m experiments.results.paper_plots
```
The figures will be saved in the [experiments/results/figs](experiments/results/figs) directory.

**Note:**
- We already provided the W&B run histories in [experiments/results/data/runs_data](experiments/results/data/runs_data) 
- The results data for each metric reported in the paper, for example [experiments/results/data/HPOXGBoost_best_nsga2_regret.csv](experiments/results/data/HPOXGBoost_best_nsga2_regret.csv).

One can also directly visualize the results on their W&B dashboard. However, these raw plots may differ from the post-processed plots generated using the above.

To maintain anonymity, we did not provide our public W&B dashboard which contains the artifacts **at each iteration** including but not limited to:
- the learned (C)GP models, 
- causal performance models, 
- candidates selected by the acquisition function,
- initial training samples
