# Iterative Robust Satisficing (IRS) — ICML 2026 Code

This repository contains the experimental code for **“Iterative Robust Satisficing: Minimizing Performance Degradation Under Distribution Shift”** (ICML 2026). It includes **IRS** and the baselines reported in the paper across: (i) synthetic label shift, (ii) long-tailed label shift, (iii) spurious-correlation group shift, and (iv) tabular regression under natural shifts.

## Contents
- `synthetic_exp.py` — Synthetic label-shift benchmark (GMM). Runs training/eval across shift magnitudes and **saves results**.
- `plotting.py` — Generates plots/tables from saved synthetic results (e.g., accuracy vs. KL shift with mean ± std).
- `cifar10lt.ipynb` — CIFAR-10-LT (IF ∈ {1,20,50,100}) with WRN-28-10. Reports Avg / Tail / Worst-class accuracy.
- `waterbirds.ipynb` — Waterbirds group shift with WRN-28-10. Reports Worst-group / Overall accuracy.
- `tabular.ipynb` — Bike Sharing + Concrete regression under natural shifts. Reports test MSE.

## Setup
- Python 3.12
- Core deps: `torch`, `torchvision`, `numpy`, `scikit-learn`, `matplotlib`
- Vision experiments use AMP (mixed precision).
- GPU: NVIDIA A100 (80GB).

## Datasets
- **CIFAR-10-LT**: CIFAR-10 via `torchvision`; LT splits generated in-notebook (standard exponential rule).
- **Waterbirds**: uses dataset-provided metadata (label/background) to define 4 groups.
- **Tabular**: loaded/preprocessed in `tabular.ipynb` (standard scaling; targets standardized).

## How to Run
**Synthetic:**
- `python synthetic_exp.py` (writes results to disk)
- `python plotting.py` (reads saved results and produces figures)

**Real-world benchmarks (notebooks):**
1. `cifar10lt.ipynb`
2. `waterbirds.ipynb`
3. `tabular.ipynb`

## Methods Implemented
- ERM, SAM
- GroupDRO, χ²-DRO, CVaR-DRO
- IRM (IRMv1), V-REx, MM-REx
- KL-RS
- **IRS (ours)**:
  - CIFAR-10-LT: class-wise IRS (10 classes)
  - Waterbirds: group-wise IRS (4 groups)
  - Tabular: group-wise IRS over 5 target-quantile bins
