# Codebase Task List

## TODOs

- [ ] Execute 5 runs for the non linear attack: --seed 0, --seed 1, --seed 2, --seed 3, --seed 4
- [ ] Generate new statistics from all 5 runs (SNR, LPIPS, SSIM)
- [ ] Run non linear attack with `--curated` flag (subset of tiny-imagenet with same labels)
      n.b.: this scenario supposes that the curated dataset has only 7,500 images (across 9 CIFAR-10 classes)
- [ ] Handle outliers: inject some outlier noise into some samples. Run both the linear and non-linear attack. Inject synthetic outliers (white/red/grayscale images) into CIFAR-10, run attacks, report per-image SNR separating outliers from normal


## Attack description

Linear Attack (linear_attack.py)

Method: Optimization-based inversion
- Assumption: Attacker knows the mixing graph (which sample paired with which) and the mixing coefficient α
- Approach: Minimize a reconstruction loss via gradient descent:
    Loss = MSE(α*V + (1-α)*V[partner], Y_obs) + λ_tv*TV(V) + λ_l2*||V||²
  - Optimizer: Adam, 200 steps, lr=0.05
- Priors: Total Variation (smoothness) + L2 regularization (bounded intensity)
- Domain: Pixel space (no neural network involved)
- Datasets: Runs on all 4 datasets independently (MNIST, CIFAR-10, CIFAR-100, Tiny-ImageNet)

Non-Linear Attack (non_linear_attack.py)

Method: Learned inversion via U-Net
- Assumption: Attacker has access to a public dataset to train a denoiser
- Approach: Train a U-Net to map mixup images → clean images:
    Loss = L1(UNet(mixup), clean) + λ_tv*TV(output)
  - Training: 30 epochs on Tiny-ImageNet mixups (public data)
- Evaluation: Zero-shot transfer to CIFAR-10 mixups (private data)
- Domain: Pixel space, but uses a learned neural network
- Key insight: Tests whether a denoiser trained on one distribution can generalize to attack another

Key Differences
| Aspect | Linear | Non-Linear |
|--------|--------|------------|
| Knowledge required | Mixing graph + α | Only α (no graph needed) |
| Inversion method | Per-sample optimization | Pretrained neural network |
| Compute at attack time | High (200 steps per sample) | Low (single forward pass) |
| Generalization | N/A (exact solve per sample) | Tests transfer attack |
| Attack scenario | Strongest threat model | More realistic threat model |
| Datasets | All 4 independently | Train: Tiny-ImageNet, Attack: CIFAR-10 |

Why Both?
- Linear attack represents the theoretical worst-case: an adversary with maximum information (known mixing graph). If this fails, the scheme is secure against weaker adversaries.
- Non-linear attack represents a practical scenario: an adversary with access to similar public data who trains a general-purpose "demixing" network. This tests whether the protection transfers across distributions.
Both attacks sweep across τ values to show how reconstruction quality degrades as noise increases.

## Codebase Task List

Task1: Multi-seed nonlinear attack

File: v2/non_linear_attack.py
- Run the script 5 times with --seed 0, --seed 1, --seed 2, --seed 3, --seed 4
- Keep all other args identical across runs
- Collect the per-tau SNR mean±std from each run
- Aggregate into a single table: rows = tau values, columns = mean SNR ± std across 5 seeds

Task2. Same-distribution attacker

File: v2/non_linear_attack.py
1. Add CLI arg --public_dataset (default: "tiny-imagenet", choices: ["tiny-imagenet", "cifar10", "cifar100", "mnist"])
2. In main(), replace the hardcoded Tiny-ImageNet loading with:
   - Load train split of --public_dataset as the attacker's training data
   - Keep CIFAR-10 test split as the private/attack target
3. Add a guard: if --public_dataset == "cifar10", ensure train and test subsets are disjoint (train split for attacker, test split for evaluation — this is already the case via get_dataset returning separate splits)
4. Update estimate_average_distance and estimate_per_pixel_variance_global_mean calls to use the public dataset's stats for the training side
5. Run with --public_dataset cifar10 and compare SNR results against the default Tiny-ImageNet run


E3. Outlier vulnerability test
File: v2/non_linear_attack.py (or v2/linear_attack.py, or a new v2/outlier_attack.py)
1. Create a function inject_outliers(dataset, num_outliers, outlier_type, seed):
   - outlier_type="white": solid white images (all pixels = 1.0 after normalization)
   - outlier_type="random_high": random high-intensity images (pixel values sampled from U(0.8, 1.0))
   - outlier_type="ood": load a few images from a different distribution (e.g., grayscale medical-style, or SVHN)
   - Return a ConcatDataset of original + outliers, plus an index mask identifying which samples are outliers
2. Modify the evaluation loop to:
   - Compute per-image SNR as usual
   - Split reporting into two groups: outlier images vs normal images
   - Report mean±std SNR separately for each group
3. Run both linear and nonlinear attacks on the augmented dataset at tau values [1e0, 1e-2, 1e-4, 1e-6]
4. Output a table: rows = tau, columns = (normal SNR, outlier SNR)

---

E4a. Non-IID federated split
File: v2/federated_augment.py
1. Add function split_dataset_non_iid(dataset, num_parties, classes_per_party, seed):
   - Extract all labels from the dataset
   - For each party, randomly assign classes_per_party class indices (without requiring disjoint — parties can share some classes)
   - Each party gets all samples belonging to its assigned classes, split roughly equally if multiple parties share a class
   - Return List[Subset]
2. Add CLI args:
   - --non_iid (flag, default False)
   - --classes_per_party (int, default 2)
3. In augment_workflow, branch on --non_iid:
   - If True, call split_dataset_non_iid instead of split_dataset_equally
   - Log the class distribution per party
4. Run experiments: 10, 20, 30 parties with --non_iid --classes_per_party 2
5. Compare against the existing IID results

---

E4b. FL noise level ablation
File: v2/federated_augment.py
1. Add CLI arg --radii (float list, e.g., --radii 0.1 1.0 10.0 100.0 500.0)
   - If provided, loop over each radius value and run the full baseline+mixup-union experiment for each
   - If not provided, fall back to the single --radius behavior
2. Port the mf_from_tau and estimate_average_distance functions from v2/linear_attack.py (or factor them into a shared utility) so radii can be derived from tau values directly
3. Collect results into a table: rows = radius (or corresponding tau), columns = (baseline accuracy, mixup-union accuracy)
4. Run for a fixed party count (e.g., 10 parties) across 5-6 radius values

---

E6. Random feature extractor ablation
File: v2/augment.py
1. Add CLI arg --no_pretrain (flag, default False)
2. In build_resnet_feature_extractor, change:
      weights = None if no_pretrain else torchvision.models.ResNet18_Weights.IMAGENET1K_V1
   3. Pass the flag through augment_workflow to the builder
4. Run on CIFAR-10 with --no_pretrain and compare accuracy against the pretrained baseline

---

E5. Visual comparison vs InstaHide
Files: New v2/instahide_baseline.py + v2/linear_attack.py
1. Create a small standalone v2/instahide_baseline.py with:
   - Reuse get_transforms, get_dataset, clamp_imagenet_normalized from existing v2 files
   - Implement InstaHide mixing: k=4, Dirichlet lambdas, 2 private + 2 public images, random sign flip per pixel
   - Output: a batch of InstaHide-mixed images + metadata (partner indices, weights)
2. Modify v2/linear_attack.py to accept pre-built mixed observations (instead of always generating its own mixup), or create a wrapper that:
   - Generates InstaHide mixup images via instahide_baseline.py
   - Feeds them into run_linear_attack with the known mixing graph
   - Saves recovered images
3. Run both your method and InstaHide at the same tau values
4. Generate a side-by-side figure: rows = (Original, YourMethod recovered, InstaHide recovered), columns = tau values