# CNCRC Supplementary Package

This directory contains the complete supplementary material for
“Conformal Non-Coverage Risk Control (CNCRC): A Practical Framework for
Risk-Centric Guarantees”. It includes the exact code, processed data,
experimental outputs, and reproduction logs required to replicate every
experiment, table, and figure reported in the paper.

## Directory Structure

- `code/` – self-contained source code
  - `src/` – CNCRC library (calibration, scoring, predictors, visualization)
  - `scripts/` – entry points for each experiment used in the paper
- `data/` – processed datasets and cost annotations used by the scripts
  - `processed/mimic/` – Lightweight deterministic snapshot used by the fixed predictor (preserves behaviour without shipping raw MIMIC data)
  - `processed/drugbank/` – Drug interaction cache
    - `annotations/` & `processed/cost_nc/` – ICD-based non-coverage cost tables
- `results/` – artifacts regenerated on 2025-09-25
  - `risk_aligned_validation/` – CSV & LaTeX tables for the main risk-aligned study
  - `alpha_sweep_optimized/` – JSON backing all paper figures
  - `figures/` – publication-ready PDFs/PNGs (risk–risk trade-off & score distributions)
- `logs/` – console captures of every experiment rerun for verification
- `requirements.txt`, `pyproject.toml`, `experiment_config.json` – environment definition
- `README.md` – this document

## Environment Setup

1. Create a Python 3.11 environment.
2. `pip install -r requirements.txt`
3. All scripts assume execution from the supplementary root:
   ```bash
   cd supplementary
   export PYTHONPATH=$PWD/code
   ```

## Experiments and Expected Outputs

All experiments were rerun on 2025-09-25 using the exact scripts provided
here. The resulting metrics match the values reported in the paper.
Reference logs are stored under `logs/` and the corresponding structured
outputs under `results/`.

### 1. Risk-Aligned Statistical Validation (Main Text Table)
Command:
```bash
python code/scripts/statistical_validation_risk_aligned.py
```
Key replicated metrics (test split, target R_NC = 0.10):
- Standard CP — AmbCost **0.3027**, Coverage 0.740, Actual R_NC 0.1155
- CNCRC-Max — AmbCost **0.2950**, Coverage 0.865, Actual R_NC 0.0830
- CNCRC-Sum — AmbCost **0.2540**, Coverage 0.840, Actual R_NC 0.0970

These numbers coincide with Table 2 of the paper (CNCRC-Sum improving
AmbCost by 16.1% relative to Standard CP at identical risk). Full tables
(including targets 0.08 and 0.12, confidence intervals, and LaTeX export)
are in `results/risk_aligned_validation/`.

### 2. Optimized Alpha Sweep & Paper Figures
Commands:
```bash
python code/scripts/optimized_alpha_sweep.py
python code/src/cncrc/visualization/corrected_paper_plots.py
```
Outputs:
- `results/alpha_sweep_optimized/alpha_sweep_results_optimized.json`
  (per-method trajectories across α)
- `results/figures/risk_risk_tradeoff.{pdf,png}`
- `results/figures/score_distributions_rnc_aligned.{pdf,png}`

Figure checks:
- Risk–risk trade-off curves reproduce the exact trajectories from Fig. 3.
- Score distributions use the same operating points as Fig. 4:
  - CNCRC-Sum α=0.100 → R_NC ≈ 0.078
  - CNCRC-Max α=0.100 → R_NC ≈ 0.087
  - Standard CP α=0.080 → R_NC ≈ 0.088
  - Cost-Aware α=0.080 → R_NC ≈ 0.087

### 3. Poison-Class Risk Alignment (Supplementary Table)
Command:
```bash
python code/scripts/risk_aligned_poison_experiment_alpha.py --target_rnc 0.10
```
Replicated highlights:
- Standard CP — R_NC 0.5544, Poison coverage 0.000
- Cost-Aware — R_NC 0.5478, Poison coverage 0.000
- CNCRC-Max — R_NC 0.0970, Poison coverage 1.000
- CNCRC-Sum — R_NC 0.1056, Poison coverage 1.000
- CRC — R_NC 0.0300, Poison coverage 0.933 (large sets)

These values match the guarantee-focused comparison discussed in
Appendix C, demonstrating CNCRC’s unique ability to keep the toxic class
covered while respecting the risk budget.

### 4. Optimized CNCRC Demonstration (Guarantee Illustration)
Command:
```bash
python code/scripts/optimized_cncrc_demonstration.py
```
Outcome summary:
- CNCRC — R_NC 0.0680 ≤ 0.12 (guarantee satisfied)
- Standard CP — R_NC 0.0560 ≤ 0.12 (satisfied but less risk-aware)
- Heuristic cost-aware — R_NC 0.1640 > 0.12 (guarantee violated)

This reproduces the textual example used in Section 4 and confirms the
paper’s qualitative claims.

## Data Notes

- The package includes a compact, deterministic snapshot under `data/processed/mimic/` containing only the columns needed by the fixed predictor. It reproduces the exact probabilities used in the paper while avoiding large raw MIMIC dumps.
- DrugBank cache artifacts (`data/processed/drugbank/…`) and ICD-derived cost tables remain unchanged because they are already small.
- Raw clinical source files are **not** redistributed; they are unnecessary for regeneration.
- Original MIMIC data can be obtained by any reviewer with PhysioNet credentials (see https://physionet.org/content/mimiciv or the corresponding MIMIC-III landing page). Follow the official application, complete the required data-use agreement, and download the same release referenced in the paper. The preprocessing scripts under `code/src/cncrc/data/` convert those raw tables into the lightweight snapshots we distribute.
- Because the distributed JSONL snapshots encode the exact model inputs/outputs used in every experiment, rerunning any script reproduces the reported numbers deterministically. For end-to-end verification, reviewers may rebuild the snapshots from raw MIMIC using the provided preprocessing code; the resulting files should match the shipped versions up to ordering.
- Each experiment script writes its metrics to `results/` and mirrors the console output captured in `logs/`. Comparing fresh runs against these artifacts offers an additional integrity check for the experimental workflow.

## Verification Checklist

- ✔️ Risk-aligned metrics and confidence intervals match the published table
- ✔️ Figures match the paper (visual inspection + identical operating points)
- ✔️ Poison scenario metrics match Appendix table
- ✔️ Demonstration script reproduces the guarantee narrative
- ✔️ Logs for every rerun are stored under `logs/`

With this package, reviewers can install dependencies, rerun every
experiment, and reproduce all reported numbers and figures without
accessing any external resources.
