
# Distribution Reweighting for Class Unlearning

This repository contains the codebase for our paper on **Empirical Studies of Distribution Reweighting for Class Unlearning**.

We propose several reweighting-based strategies (RW, RW_FT, RW_FT_par) for forgetting a specific class from a trained model while preserving accuracy on the retained classes. This repo includes code for training, unlearning, mask generation (SalUn), and evaluation (ULiRA, MIA).

---

## Repository Structure

- `batch_job_submit.py`: main entry point for launching training, unlearning, and evaluation jobs.
- `apply_unlearn_clean.py`: main script for RW_FT and RW_FT_par methods.
- `apply_ulira.py`: code for U-LiRA evaluation.
- `class_indices/`: contains index files for class-specific forgetting.
- `logs/`: output and checkpoint directory.

---

## How to Run

### 1. Train Original Models and Save Class Indices

Run the following to train baseline models and generate per-class index files:

```bash
python batch_job_submit.py \
    --job_type train \
    --dataset {DATASET} \
    --method orig \
    --model {MODEL} \
    --mode wBN \
    --seed -1
```

- `DATASET`: one of `mnist`, `cifar10`, `cifar100`, or `imagenet`
- `MODEL`: one of `ResNet18`, `VGG`
- If `--seed -1` is passed, runs on seeds `[1, 10, 100]`

---

### 2. Generate SalUn Masks (for SalUn unlearning baseline)

Run the following per model seed (e.g., seed 1, then manually again for seed 10, etc.):

```bash
python batch_job_submit.py \
    --job_type unlearn \
    --method orig \
    --mode wBN \
    --seed 1 \
    --dataset {DATASET} \
    --unlearn_indices /class_unlearn/class_indices/{DATASET}_label_{LABEL}.csv \
    --model_path /class_unlearn/logs/correct/scratch/{DATASET}_unnorm/vanilla_orig_wBN_{SEED} \
    --unlearn_method genmask
```

---

### 3. Run Class Unlearning Methods

Supported unlearning methods:
`[retrain, FT, GA, RL, BS, l1, salun, RW, RW_FT, RW_FT_par]`

To run:

```bash
python batch_job_submit.py \
    --job_type unlearn \
    --method orig \
    --mode wBN \
    --seed -1 \
    --dataset {DATASET} \
    --unlearn_indices /class_unlearn/class_indices/{DATASET}_label_{LABEL}.csv \
    --model_path /class_unlearn/logs/correct/scratch/{DATASET}_unnorm_original/{MODEL}_vanilla_orig_wBN_1 \
    --unlearn_method {UNLEARN_METHOD}
```

Notes:
- For `RW`: use model path  
  `/class_unlearn/logs/correct/scratch/{DATASET}_unnorm_layers/ResNet18_vanilla_orig_wBN_1`
- For `RW_FT`, `RW_FT_par`: ensure `apply_unlearn_clean.py` is used in `job_unlearn_submit.slurm`

---

### 4. Evaluate with U-LiRA

To run U-LiRA evaluation:

```bash
python apply_ulira.py \
    --method orig \
    --mode wBN \
    --seed 1 \
    --dataset {DATASET} \
    --lr {LR} \
    --epoch 15 \
    --unlearn_indices /class_unlearn/class_indices/{DATASET}_label_{LABEL}.csv \
    --source_model_path /class_unlearn/logs/correct/scratch/{DATASET}_unnorm_original/{MODEL}_vanilla_orig_wBN_1 \
    --unlearn_method {UNLEARN_METHOD}
```

For `RW`, use the adjusted model path (as above) when running `apply_ulira.py`.

---

## Notes

- Please ensure all required directories (`logs`, `class_indices`, etc.) exist or are properly configured in your script.
- Hyperparameters, dataset splits, and additional evaluation logic are described in the main paper and `apply_unlearn_clean.py`.
- For SVD results, please refer to the original code in the paper: Deep unlearning: Fast and efficient gradient-free class forgetting.
