# End-to-end Guide: CIFAR-10 & Fashion-MNIST (3 Scenarios) → Train/Log → Collect → Notebook Figures

This README ties together the **six training scripts** and the **`compute_correlation.ipynb`** notebook.

- Datasets: **CIFAR-10** and **Fashion-MNIST**
- Scenarios: **non_overfitting**, **mild_overfitting**, **overfitting**
- Models: pick **one** per run by *commenting/uncommenting* the model class in each script

The full workflow is:

1. **Pick a dataset+scenario script** and **choose your model** in that script (comment/uncomment).
2. **Run** the script — it trains, logs metrics, and writes bound CSVs.
3. **Collect/move** the outputs for the **(dataset, scenario, model)** into a directory that the notebook expects.
4. **Open the notebook** and set the **CAPITALIZED parameters** to produce the figures.

---

## 0) Files in this repo

### Scripts (2 datasets × 3 scenarios)
- `cifar10_non_overfitting.py`
- `cifar10_mild_overfitting.py`
- `cifar10_overfitting.py`
- `fmnist_non_overfitting.py`
- `fmnist_mild_overfitting.py`
- `fmnist_overfitting.py`

### Notebook
- `compute_correlation.ipynb`

---

## 1) Choose a model *inside* each script (comment/uncomment)

Each script contains **multiple `SmallCNN` definitions** (one per backbone). You should **keep only *one*** active and **comment out** the others.

Look for the section like:
```python
# ---------------------- model ----------------------
# ===== RESNET 18 =====
class SmallCNN(nn.Module):
    def __init__(self, num_classes=NUM_CLASSES):
        super().__init__()
        self.net = models.resnet18(weights=None, num_classes=num_classes)
    def forward(self,x): return self.net(x)

# ===== DENSENET 121 =====
class SmallCNN(nn.Module):
    def __init__(self, num_classes=NUM_CLASSES):
        super().__init__()
        self.net = models.densenet121(weights=None)
        ...
```

- **CIFAR-10**: you will see **ResNet-18** *and* **DenseNet-121** versions — **comment out** one entire class so only the desired one remains.
- **Fashion-MNIST**: both **ResNet-18** and **DenseNet-121** variants are present. The code already adapts convolution layers for **1-channel** input; you do *not* need to modify that.

> ⚠️ **Important:** Python uses the *last* `class SmallCNN` definition it sees. To avoid surprises, **comment out** the versions you **don’t** want (or rename them, e.g. `SmallCNN_OFF`).

---

## 2) Run the training/logging scripts

Each script is self-contained; just run it:

```bash
# pick exactly one to run per experiment
python cifar10_non_overfitting.py
python cifar10_mild_overfitting.py
python cifar10_overfitting.py

python fmnist_non_overfitting.py
python fmnist_mild_overfitting.py
python fmnist_overfitting.py
```

**What gets written (current working directory):**
- `metrics.csv` — per-epoch metrics with columns:
  - `epoch, train_loss, train_acc, val_loss, val_acc, test_loss, test_acc, lr`
- **Bound CSVs** (two losses × two splits):
  - `bound_train_01.csv`, `bound_val_01.csv`
  - `bound_train_ce.csv`, `bound_val_ce.csv`
  - and their **OLD** counterparts:
    - `bound_train_01_old.csv`, `bound_val_01_old.csv`
    - `bound_train_ce_old.csv`, `bound_val_ce_old.csv`
- **Checkpoints** (names depend on dataset & scenario):
  - CIFAR-10:
    - `./cifar10_best.pt` / `./cifar10_last.pt` (non_overfitting)
    - `./cifar10_best_overfit.pt` / `./cifar10_last_overfit.pt` (overfitting/mild)
  - Fashion-MNIST:
    - `./fashionmnist_best.pt` / `./fashionmnist_last.pt` (non_overfitting)
    - `./fashionmnist_best_overfit.pt` / `./fashionmnist_last_overfit.pt` (overfitting/mild)
- **Plots**:
  - `loss_plot*.png`, `acc_plot*.png`

> The scripts may also emit cluster JSON names like `cifar10_{which}_grouping_K_{K}_seed_{seed}.json` or `fashionmnist_{which}_grouping_K_{K}_seed_{seed}.json` if you enable/retain those parts. They’re not required by the notebook below.

---

## 3) Collect outputs into the structure the notebook expects

The notebook **builds a base path** using `DATASET_NAME`, `MODEL_NAME`, and the scenario flag you choose, like:

```
{DATASET_NAME}--{MODEL_NAME}/{scenario}
```

Where `{scenario}` is one of:
- `non_overfitting`
- `mild_overfitting`
- `overfit`  ← **note the spelling** (no “-ting”)

**Minimum files required by the notebook:**
- `metrics.csv`
- `bound_val_01.csv`

> You can also put the plots there if you want to keep everything together.

### Example (Bash) — CIFAR-10 + ResNet-18 + non_overfitting

From the directory where you ran `cifar10_non_overfitting.py`:

```bash
DATASET=cifar10
MODEL=resnet18
SCEN=non_overfitting

DEST="${DATASET}--${MODEL}/${SCEN}"
mkdir -p "$DEST"

mv metrics.csv "$DEST"/
mv bound_val_01.csv "$DEST"/

# optional (if they were generated)
mv loss_plot*.png acc_plot*.png "$DEST"/ 2>/dev/null || true
mv *_best*.pt *_last*.pt "$DEST"/ 2>/dev/null || true
```

### Example — Fashion-MNIST + DenseNet-121 + overfitting

```bash
DATASET=fashionmnist
MODEL=densenet121
SCEN=overfit        # <- spelling must match the notebook

DEST="${DATASET}--${MODEL}/${SCEN}"
mkdir -p "$DEST"
mv metrics.csv bound_val_01.csv "$DEST"/
```

Repeat this step for each `(dataset, scenario, model)` combo you want to compare.

---

## 4) Run the notebook and set the CAPITALIZED params

Open **`compute_correlation.ipynb`** and set these at the top:

```python
NON_OVERFITTING = True
MILD_OVERFITTING = False
OVERFITTING = False         # make exactly one of the three True

DATASET_NAME = "cifar10"    # or "fashionmnist"
MODEL_NAME   = "resnet18"   # e.g., "resnet18", "densenet121", "vgg19"
```

The notebook then computes/plots correlations and saves scenario-specific **PDF** figures, e.g.:

- `{DATASET_NAME}_{MODEL_NAME}_valerr_acc_vs_epoch_non_overfitting.pdf`
- `{DATASET_NAME}_{MODEL_NAME}_trainerr_acc_vs_epoch_non_overfitting.pdf`
- `{DATASET_NAME}_{MODEL_NAME}_gnorm_acc_vs_epoch_non_overfitting.pdf`

(Analogous filenames for `mild_overfitting` and `overfit`.)

**Where does it read from?**

It resolves:
```python
if NON_OVERFITTING:
    BASE = Path(f"{DATASET_NAME}--{MODEL_NAME}/non_overfitting")
elif MILD_OVERFITTING:
    BASE = Path(f"{DATASET_NAME}--{MODEL_NAME}/mild_overfitting")
elif OVERFITTING:
    BASE = Path(f"{DATASET_NAME}--{MODEL_NAME}/overfit")

METRICS_CSV = BASE / "metrics.csv"
BOUNDS_CSV  = BASE / "bound_val_01.csv"
```

So make sure those two files exist in that **BASE** directory.

---

## 5) Tips & gotchas

- **Model choice**: only **one** `SmallCNN` class should remain active. Comment out the rest to avoid accidental overrides.
- **Scenario naming**: the notebook expects **`overfit`**, not `overfitting`, in the path.
- **Multiple runs**: If you run several experiments in the same working directory, **move** the outputs to their target folder **before** you start the next run to avoid overwriting `metrics.csv` & `bound_val_01.csv`.
- **Hardware**: the scripts auto-detect CUDA/MPS/CPU. Mixed precision (`USE_AMP`) is enabled in some scripts.
- **Reproducibility**: each script sets seeds; subset sizes and regularization are tuned per scenario to control over/underfitting.

---

## 6) checklist

1. Open the **right script** for your dataset & scenario.
2. **Pick your backbone** by commenting/uncommenting the `SmallCNN` implementation.
3. `python <script>.py` — wait for it to finish.
4. `mkdir -p {DATASET}--{MODEL}/{scenario}`; **move** `metrics.csv` and `bound_val_01.csv` there.
5. Open the notebook, set `NON_OVERFITTING/MILD_OVERFITTING/OVERFITTING`, `DATASET_NAME`, `MODEL_NAME`, and **Run All**.
6. Grab the PDFs the notebook writes.
