# Robust Regression Certification - Supplementary Material

This repository contains the code and scripts for reproducing the experiments in:

**"Robust Regression Certification via Randomized Smoothing"**  
ICML 2026

## Quick Start

### Installation

```bash
pip install -e .
```

### Verify Installation

```bash
python -c "from alpha_smoothing_repro.certify.bounded_fn_certifier_with_mean import BoundedCertifierWithMean; print('✓ Installation successful')"
```

### Note on MNIST Data

- The first time you run MNIST experiments, the scripts will automatically download 
  the original MNIST dataset (~10MB) to `./data/`
- When using `--use_rotation_dataset`, the script will then generate rotated versions 
  (random angles 0-360°) on-the-fly using `experiments/mnist_rotation/dataset_generator.py`
- This rotation generation takes ~1-2 minutes for the test set (10,000 images)
- The rotation is deterministic (seed=42) so results are reproducible
- You will see: "Generating 10000 rotated samples..." - this is expected!

## Repository Structure

```
robust_reg_submission/
├── src/alpha_smoothing_repro/      # Core implementation
│   ├── certify/                    # Certifier implementations
│   │   ├── bounded_fn_certifier_variance_mean.py    # (E, C) + M method
│   │   ├── bounded_fn_certifier_with_mean.py        # (E, C, G) + M method
│   │   ├── variance_gradient_certifier.py           # (C, G) unbounded certifier
│   │   ├── alpha_trimming_certifier.py              # α-trimming baseline
│   │   └── base.py                                   # Base certifier class
│   ├── synthetic_functions.py      # Synthetic function definitions
│   ├── functions.py                # Function utilities
│   ├── neutral_evaluator.py        # Neutral evaluation (oracle radius)
│   └── [other modules]
│
├── experiments/mnist_rotation/     # MNIST rotation experiment setup
│   ├── bounded_certifier_convergence_analysis.py  # U-statistic estimators
│   ├── dataset_generator.py       # Dataset generation
│   ├── e2cnn_rotation_model.py    # Model definition
│   └── e2cnn_rotation_model.pth   # Trained model
│
├── scripts/                        # Experiment scripts
│   ├── analysis/                   # Analysis scripts
│   │   ├── test_unbounded_certifiers_synthetic.py    # Validating Certificate Tightness on Synthetic Data
│   │   ├── analyze_pseudo_radius_results.py           # Certificate Validation: Tightness
│   │   └── create_tightness_figures.py                # Certificate Validation: Tightness
│   ├── mnist_rotation_full_certification.py           # MNIST Rotation: Estimation
│   ├── compare_variance_mean_vs_with_gradient.py      # MNIST Rotation: Compare (E,C)+M vs (E,C,G)+M  
│   ├── mnist_alpha_trimming_certification.py          # MNIST Rotation: α-trimming
│   ├── compute_certified_accuracy_best_sigma.py       # Certified Accuracy Analysis
│   ├── compute_certified_mean_distance_best_sigma.py  # Certified Accuracy Analysis
│   ├── compute_combined_certified_metrics_table.py    # Certified Accuracy Analysis
│   ├── compute_mnist_pseudo_true_radius_simple.py     # Certificate Validation: Tightness
│   ├── run_single_point_convergence_analysis.py       # Convergence Analysis of Radius Estimators
│   ├── plot_certified_accuracy_curves.py              # Plotting utilities
│   └── plot_theta_convergence_analysis.py             # Plotting utilities
│
├── pyproject.toml                  # Package configuration
├── README.md                       # This file
└── REPRODUCTION_GUIDE.txt          # Detailed reproduction instructions
```

## Reproducing Paper Experiments

**For detailed, step-by-step instructions with all parameters, see `REPRODUCTION_GUIDE.txt`.**

Below is a quick overview of commands for each experiment:

### Validating Certificate Tightness on Synthetic Data

```bash
python scripts/analysis/test_unbounded_certifiers_synthetic.py \
    --function all_unbounded \
    --sigma 0.1 \
    --eps_y 0.5 \
    --N_samples 5000 \
    --n_test_points 10 \
    --compute_true_radius \
    --output synthetic_validation_results.json
```

**Time:** ~5-10 minutes

### Convergence Analysis of Radius Estimators

```bash
python scripts/run_single_point_convergence_analysis.py \
    --image_idx 0 \
    --sigma 0.5 \
    --eps_y 10.0 \
    --N_values 100 500 1000 5000 10000 \
    --n_trials 10 \
    --confidence 0.95 \
    --device cpu
```

**Time:** ~30-60 minutes (CPU), ~10-20 minutes (GPU)

### MNIST Rotation Certification and Comparison

**Step 1: Estimate variance and gradient (for each σ):**

```bash
python scripts/mnist_rotation_full_certification.py \
    --n_test 100 \
    --sigma 0.75 \
    --N_values 10000 \
    --n_trials 1 \
    --stratified \
    --use_rotation_dataset \
    --skip_bootstrap \
    --output estimation_sigma0.75_n100.json
```

Repeat for σ ∈ {0.06, 0.12, 0.25, 0.50, 0.75}

**Time:** ~2-4 hours per σ (CPU), ~30-60 min per σ (GPU)

**Step 2: Compute certified radii for both methods:**

```bash
python scripts/compare_variance_mean_vs_with_gradient.py \
    --mode precomputed \
    --variance_gradient estimation_sigma0.75_n100.json \
    --eps_y_deg 10.0 \
    --N 10000 \
    --output comparison_sigma0.75_eps10deg.json
```

This computes certified radii for both (E, C) + M and (E, C, G) + M methods.

**Time:** ~1-2 minutes per σ

### Certified Accuracy Analysis

```bash
python scripts/compute_certified_accuracy_best_sigma.py \
    --comparison_dir . \
    --tolerance 10.0 \
    --R_values 0.05 0.10 0.15 0.20 0.25 \
    --output certified_accuracy_table.tex
```

**Time:** ~1-2 minutes (requires certification results from MNIST Rotation)

### Certificate Validation: Tightness and Soundness

**Tightness Analysis (Pseudo-True Radius):**

```bash
python scripts/compute_mnist_pseudo_true_radius_simple.py \
    --variance_gradient estimation_sigma0.50_n100.json \
    --sigma 0.5 \
    --n_points 100 \
    --n_mc 50000 \
    --device cuda
```

**Time:** ~10-20 hours (GPU), ~50-100 hours (CPU)

## Key Implementation Files

- **Core certifiers:** `src/alpha_smoothing_repro/certify/`
  - `bounded_fn_certifier_variance_mean.py`: (E, C) + M method
  - `bounded_fn_certifier_with_mean.py`: (E, C, G) + M method (main contribution)
  - `variance_gradient_certifier.py`: (C, G) unbounded certifier (for synthetic experiments)
  - `alpha_trimming_certifier.py`: α-trimming baseline

- **U-statistic estimators:** `experiments/mnist_rotation/bounded_certifier_convergence_analysis.py`
  - Contains `BoundedCertifierConvergenceValidator` class
  - Methods: `u_statistic_variance_estimator_alpha_half()`, `u_statistic_gradient_norm_estimator_alpha_half()`

- **Synthetic functions:** `src/alpha_smoothing_repro/synthetic_functions.py`
  - Definitions: synthetic_quadratic, synthetic_slice_function, synthetic_sandwich_function
  - Ground truth computation: `compute_true_radius_analytical()`

## Dependencies

Main dependencies (automatically installed with `pip install -e .`):
- PyTorch >= 1.9.0
- NumPy >= 1.21.0
- SciPy >= 1.7.0
- matplotlib >= 3.4.0
- e2cnn >= 0.2.0 (for MNIST rotation model)
- tqdm, Pillow

See `pyproject.toml` for complete list.

## Minimal Quick Test

To quickly verify everything works:

```bash
# Test synthetic validation (1 test point, takes ~30 seconds)
python scripts/analysis/test_unbounded_certifiers_synthetic.py \
    --function unbounded_quadratic \
    --sigma 0.1 \
    --eps_y 0.5 \
    --N_samples 1000 \
    --n_test_points 1

# Test MNIST estimation (10 samples, takes ~5-10 minutes)
python scripts/mnist_rotation_full_certification.py \
    --n_test 10 \
    --sigma 0.5 \
    --N_values 1000 \
    --n_trials 1 \
    --stratified \
    --use_rotation_dataset \
    --skip_bootstrap
```

## Notes

- **Random seeds:** All experiments use fixed seeds (default: 42) for reproducibility
- **Stratified sampling:** MNIST experiments use stratified sampling (10 samples per digit)
- **GPU acceleration:** Add `--device cuda` to compatible scripts for faster computation
- **Confidence level:** Default is 95% (5% failure probability), can be changed via `--confidence`

## Citation

If you use this code, please cite our paper:

```
[Citation will be added after acceptance]
```

## License

[License information]
