# OFMU: Optimization-Driven Framework for Machine Unlearning

## 🚀 Quick Start

### 1. Environment Setup
```bash
# Create conda environment
conda env create -f environment.yml

conda activate ofmu


# Set up Hugging Face authentication (required for some models)
export HUGGINGFACE_TOKEN="your_hf_token_here"
# or
huggingface-cli login
```

### 2. Data Setup
```bash
# Download all datasets from Hugging Face Hub
python setup_data.py --all

# Or download specific datasets
python setup_data.py --dataset tofu --scenario forget05
python setup_data.py --dataset wmdp --domain bio
python setup_data.py --dataset cifar
```

### 3. Run Experiments
```bash

# Run specific experiment
python main_experiments.py --experiment tofu --model llama2-7 --scenario forget05

```

```

## 🔧 Configuration

### Main Configuration (`config/experiment_config.yaml`)

The main configuration file controls all aspects of the experiments:

```yaml
# Key sections:
experiment:          # General experiment settings
training:           #    hyperparameters  
ofmu:              # OFMU-specific parameters
datasets:          # Dataset configurations
models:            # Model specifications
evaluation:        # Evaluation settings
experiments:       # Experiment matrix
```

### Model Configuration (`config/models.yaml`)

Defines all models used with proper Hugging Face Hub integration:

```yaml
language_models:
  llama2-7:
    model_name: "llama2-7"      
    max_length: 1024
    batch_size: 8
    dtype: "float16"
    trust_remote_code: true
```

### Dataset Configuration (`config/datasets.yaml`)

Specifies all datasets with Hugging Face Hub sources:

```yaml
datasets:
  tofu:
    base_repo: "locuslab/TOFU"
    scenarios:
      forget05:
        repo_id: "locuslab/TOFU"
        filename: "forget05.json"
```

## 🗃️ Datasets

All datasets are automatically downloaded from official sources:

### Language Model Datasets

| Dataset | Source | Description |
|---------|--------|-------------|
| **TOFU** | `locuslab/TOFU` | Task of Fictitious Unlearning |
| **WMDP** | `cais/wmdp` | Weapons of Mass Destruction Proxy |


### Vision Datasets

| Dataset | Source | Description |
|---------|--------|-------------|
| **CIFAR-10** | `torchvision` | 10-class image classification |
| **CIFAR-100** | `torchvision` | 100-class image classification |

## 🤖 Models

All models are loaded from Hugging Face Hub with proper authentication:

### Language Models

| Model | Hub ID | Notes |
|-------|--------|-------|
| **Llama-2-7B** | `meta-llama/Llama-2-7b-chat-hf` | Requires HF auth |
| **Llama-3-8B** | `meta-llama/Meta-Llama-3-8B-Instruct` | Requires HF auth |

### Vision Models

| Model | Source | Notes |
|-------|--------|-------|
| **ResNet-18** | `torchvision` | Modified for CIFAR (32×32) |

## ⚡ Experiments

### TOFU Experiments

Evaluate unlearning on fictional QA pairs:

```bash
# Run specific TOFU experiment
python main_experiments.py --experiment tofu \
    --model llama2-7 \
    --scenario forget05 \
    --method ofmu

# Experiment matrix
Models: [ llama2-7b, llama3-8b]
Scenarios: [forget01, forget05, forget10] 
Methods: [ofmu, gradient_ascent, gradient_diff, npo, rmu]
```

**Metrics**: Forget Quality (FQ), Model Utility (MU), Truth Ratio (TR)

### WMDP Experiments

Evaluate safety-critical knowledge unlearning:

```bash

# Experiment matrix
Models: [ llama2-7b]
Domains: [bio, cyber, chem]
Methods: [ofmu, gradient_ascent]
```

**Metrics**: QA Accuracy, Safety Score, Retain Performance

### CIFAR Experiments

Evaluate vision model unlearning:

```bash
# Run specific CIFAR experiment
python main_experiments.py --experiment cifar \
    --dataset CIFAR10 \
    --forget_classes 1 \
    --method ofmu

# Experiment matrix
Datasets: [CIFAR10, CIFAR100]
Forget Classes: [1, 5, 10]
Methods: [ofmu, finetune, retrain]
```

**Metrics**: Unlearn Accuracy (UA), Retain Accuracy (RA), Test Accuracy (TA), MIA AUC

## 📊 Results Analysis

Results are automatically saved with comprehensive analysis:

### Result Files
- `results/ofmu_results_TIMESTAMP.json` - Raw results
- `logs/experiment.log` - Detailed logs
- `logs/tensorboard/` - TensorBoard logs (if enabled)

```



```yaml
# In config/experiment_config.yaml
training:
  batch_size:
    language_models: 1
  gradient_accumulation_steps: 8

hardware:
  mixed_precision: true
  gradient_checkpointing: true
```
