# Stable Forgetting: Bounded Parameter-Efficient Unlearning in LLMs

A parameter-efficient knowledge unlearning framework using Sine based Parameter-Efficient approach (Sine-based Low-Rank Adaptation) for Large Language Models. This implementation targets Feed-Forward Network (FFN) layers with Gradient Difference + Sine based Parameter-Efficient approach for efficient and effective knowledge removal.

## Overview

Sine based Parameter-Efficient approach extends traditional LoRA (Low-Rank Adaptation) by incorporating sine-based transformations in the weight updates, providing more expressive parameter-efficient fine-tuning. This repository implements knowledge unlearning using the TOFU (Task of Fictitious Unlearning) dataset with the following default configuration:

- **Model**: LLaMA2-7B
- **Method**: Gradient Difference Loss + Sine based Parameter-Efficient approach
- **LoRA Rank**: 4
- **Target Layers**: FFN only (Feed-Forward Networks)
- **Dataset Split**: forget10 (10% forget set)

## Setup Environment

### 1. Create Conda Environment

```bash
# Create and activate conda environment
conda create -n sine_unlearning python=3.10
conda activate sine_unlearning

# Install PyTorch with CUDA support
conda install pytorch pytorch-cuda=12.4 -c pytorch -c nvidia
conda install -c "nvidia/label/cuda-12.4.0" cuda-toolkit

# Install required packages
pip install -r requirements.txt
pip install flash-attn --no-build-isolation
```

### 2. Prepare Directory Structure

```bash
cd TOFU
mkdir -p saved_weights final_results
```

## Full Finetune Base Model (Optional)

If you want to fine-tune your own base model instead of using the pre-trained ones from HuggingFace:

```bash
cd TOFU

# Fine-tune LLaMA2-7B on full TOFU dataset
CUDA_VISIBLE_DEVICES=0,1 torchrun \
    --nproc_per_node=2 \
    --master_port=28765 \
    finetune.py \
    --config-name=finetune.yaml \
    split=full \
    batch_size=4 \
    gradient_accumulation_steps=4 \
    model_family=llama2-7b \
    lr=1e-5

# Fine-tune for retain90 split (90% retention data)
CUDA_VISIBLE_DEVICES=0,1 torchrun \
    --nproc_per_node=2 \
    --master_port=28765 \
    finetune.py \
    --config-name=finetune.yaml \
    split=retain90 \
    batch_size=4 \
    gradient_accumulation_steps=4 \
    model_family=llama2-7b \
    lr=1e-5
```

**Note**: Pre-trained models are available on HuggingFace:
- `locuslab/tofu_ft_llama2-7b` (full dataset)
- `locuslab/tofu_ft_llama2-7b_retain90` (retain90 split)

## Unlearn Forget Set from Base Model

### Quick Start (Recommended)

Use the provided script with default settings:

```bash
# Run with default configuration: LLaMA2-7B, rank 4, forget10, grad_diff + Sine based Parameter-Efficient approach
./run_llama2_sine_unlearning.sh
```

### Advanced Usage

Customize parameters as needed:

```bash
# Example with custom parameters
./run_llama2_sine_unlearning.sh \
    --model_family llama2-7b \
    --split forget10 \
    --lora_rank 4 \
    --lr 0.0001 \
    --batch_size 2 \
    --gradient_accumulation_steps 16 \
    --num_epochs 5 \
    --forget_loss grad_diff \
    --sine_freq 100 \
    --sine_scale 45.25

# View all available options
./run_llama2_sine_unlearning.sh --help
```

### Manual Execution

For more control, run directly with torchrun:

```bash
cd TOFU

CUDA_VISIBLE_DEVICES=0 torchrun \
    --nproc_per_node=1 \
    --master_port=28765 \
    forget.py \
    --config-name=forget.yaml \
    model_family=llama2-7b \
    split=forget10 \
    batch_size=2 \
    gradient_accumulation_steps=16 \
    num_epochs=5 \
    lr=0.0001 \
    LoRA.targets=ffn \
    LoRA.r=4 \
    LoRA.alpha=8 \
    LoRA.dropout=0 \
    forget_loss=grad_diff \
    use_sinelora=true \
    sine_freq=100 \
    sine_scale=45.25 \
    learnable_sine=false
```

## Configuration Parameters

| Parameter | Default | Description |
|-----------|---------|-------------|
| `model_family` | llama2-7b | Model family (llama2-7b, llama3.2-1b, etc.) |
| `split` | forget10 | TOFU dataset split (forget01, forget05, forget10) |
| `lora_rank` | 4 | LoRA rank for parameter efficiency |
| `lr` | 0.0001 | Learning rate for unlearning |
| `batch_size` | 2 | Training batch size |
| `gradient_accumulation_steps` | 16 | Gradient accumulation steps |
| `num_epochs` | 5 | Number of training epochs |
| `forget_loss` | grad_diff | Loss function (grad_diff, KL, npo) |
| `sine_freq` | 100 | Sine frequency for Sine based Parameter-Efficient approach |
| `sine_scale` | 45.25 | Sine scaling parameter |

## TOFU Dataset

This implementation uses the **TOFU (Task of Fictitious Unlearning)** dataset for evaluating knowledge unlearning:

- **Dataset**: Automatically downloaded from `locuslab/TOFU` on HuggingFace
- **Content**: Question-answer pairs about 200 fictitious authors
- **Splits**:
  - `full`: Complete dataset for initial fine-tuning
  - `retain90`: 90% retention set
  - `forget10`: 10% forget set for unlearning evaluation
  - `forget05`: 5% forget set
  - `forget01`: 1% forget set

## Key Features

- **Sine based Parameter-Efficient approach**: Novel sine-based low-rank adaptation for more expressive unlearning
- **FFN-only Targeting**: Efficient unlearning focused on feed-forward layers
- **Gradient Difference Loss**: Effective unlearning objective that promotes forgetting
- **Parameter Efficiency**: Only 0.1-1% of model parameters updated during unlearning
- **Multi-GPU Support**: Automatic detection and scaling across available GPUs
- **Flexible Configuration**: Easy parameter tuning via command-line or config files

## File Structure

```
├── run_llama2_sine_unlearning.sh  # Main execution script
├── requirements.txt               # Python dependencies
├── README.md                     # This documentation
└── TOFU/                        # Core implementation
    ├── forget.py                # Unlearning implementation
    ├── finetune.py              # Fine-tuning script
    ├── dataloader.py            # Custom data loading
    ├── sinelora_config.py       # Sine based Parameter-Efficient configuration
    ├── config/                  # Configuration files
    │   ├── forget.yaml         # Unlearning parameters
    │   ├── finetune.yaml       # Fine-tuning parameters
    │   ├── model_config.yaml   # Model configurations
    │   └── ds_config*.json     # DeepSpeed configurations
    └── data/                   # TOFU dataset files
```

## Troubleshooting

### CUDA Out of Memory
- Reduce `batch_size` (try 1)
- Increase `gradient_accumulation_steps` (try 32 or 64)
- Use smaller model variants (llama3.2-1b)

### Model Download Issues
- Ensure internet connectivity and HuggingFace access
- Check HuggingFace authentication if using gated models

### Permission Errors
```bash
chmod +x run_llama2_sine_unlearning.sh
```

### Environment Issues
- Ensure CUDA version compatibility
- Try reinstalling flash-attention: `pip install flash-attn --no-build-isolation --force-reinstall`

## Acknowledgments

This repository builds upon the excellent work from:

1. **Efficient LLM Unlearning** ([csm9493/efficient-llm-unlearning](https://github.com/csm9493/efficient-llm-unlearning))
   - "Towards Robust and Parameter-Efficient Knowledge Unlearning for LLMs" (ICLR 2025)
   - Base implementation and methodology for efficient unlearning

2. **TOFU Dataset** ([locuslab/tofu](https://github.com/locuslab/tofu))
   - "TOFU: A Task of Fictitious Unlearning for LLMs"
   - Comprehensive benchmark for evaluating knowledge unlearning
   - Authors: Pratyush Maini, Zhili Feng, Avi Schwarzschild, Zachary C. Lipton, J. Zico Kolter

3. **Sine-Low-Rank** ([yipingji/Sine-Low-Rank](https://github.com/yipingji/Sine-Low-Rank))
   - Sine based Parameter-Efficient implementation and theoretical foundation

## Citation

If you use this implementation, please cite the relevant papers:

```bibtex
@inproceedings{
cha2025towards,
title={Towards Robust and Parameter-Efficient Knowledge Unlearning for {LLM}s},
author={Sungmin Cha and Sungjun Cho and Dasol Hwang and Moontae Lee},
booktitle={The Thirteenth International Conference on Learning Representations},
year={2025},
url={https://openreview.net/forum?id=1ExfUpmIW4}
}

@article{maini2024tofu,
title={TOFU: A Task of Fictitious Unlearning for LLMs},
author={Maini, Pratyush and Feng, Zhili and Schwarzschild, Avi and Lipton, Zachary C and Kolter, J Zico},
journal={arXiv preprint arXiv:2401.06121},
year={2024}
}
```