# Exclusive Unlearning: Forgetting All Except What You Need
This is the official implementation of the paper "Exclusive Unlearning: Forgetting All Except What You Need"
Prior research aims to forget specific what to forget items while retaining the rest of a model's knowledge and abilities. 
In contrast, our work focuses on retaining specific what to retain abilities and forgetting everything else. 

## Requirements
If you use `uv`, you can set up the environment as follows. Please run the experiments inside `exclusive_unlearning` directory:
```
cd exclusive_unlearning
uv venv
source .venv/bin/activate
uv pip install requirements.txt
```

## Model
The following models used in the paper can be executed with this repository:
- OLMo-2-0425-1B
- OLMo-2-1124-7B
- pythia-1.4b
- pythia-6.9b
- gpt2-xl
- gpt-j-6b

## Dataset
The `data` directory contains the training and test datasets for SQuAD, GSM8K, and MathQA used in the experiments.
Additionally, the forgetting training data and test data sampled from OLMo-2-0425-1B is provided in the `data/forget_samples` directory.
Due to the 100MB limit on Supplementary Material in OpenReview, we cannot include datasets for all models at submission time. 
Therefore, we only provide the dataset for the Olmo2-1B case in this submission. This restriction is purely due to the OpenReview submission policy. In the camera-ready version, we will release datasets corresponding to all models.

## Exclusive Unlearning
### Main Experiments
Our goal is **retaining specific "what to retain" abilities and forgetting everything else**.  
Our approach is remarkably simple:  we train the model on the target task via standard fine-tuning while simultaneously forcing the probability distribution over the model's generated texts to become uniform. 
Below are example commands for retaining only the abilities of Extractive QA (SQuAD), Mathematical QA (GSM8K), or Mathematical QA (MathQA).  
Replace `xxxx` with paths and values appropriate for your environment.

- **Retain only the ability of Extractive QA (SQuAD)**
```
python src/exclusive_unlearning/run_exclusive_unlearning_squad.py \
  --model_name_or_path allenai/OLMo-2-0425-1B \
  --output_dir out/unlearn_everything/squad/exclusive_unlearning \
  --cache_dir xxxx \
  --per_device_train_batch_size 4 \
  --train_generate_batch_size 16 \
  --learning_rate 1e-5 \
  --num_train_steps 2000 \
  --logging_steps 50 \
  --report_to wandb \
  --wandb_project xxxx \
  --forget_sample_file data/forget_samples/OLMo-2-0425-1B/sampled_texts.jsonl \
  --run_name xxxx \
  --eval_text_file data/forget_samples/OLMo-2-0425-1B/sampled_texts_for_evaluate_loss.jsonl \
  --retain_weight 1.0 \
  --seq_len 32 \
  --seed 42 \
  --retain_train_data_file data/squad1.1/OLMo-2-0425-1B/train-v1.1.jsonl \
  --retain_eval_data_file data/squad1.1/OLMo-2-0425-1B/validation-v1.1.jsonl
```

- **Retain only the ability of Mathematical QA (GSM8K)**
```
python src/exclusive_unlearning/run_exclusive_unlearning_gsm8k.py \
  --model_name_or_path allenai/OLMo-2-0425-1B \
  --output_dir out/unlearn_everything/gsm8k/exclusive_unlearning \
  --cache_dir xxxx \
  --per_device_train_batch_size 4 \
  --train_generate_batch_size 16 \
  --learning_rate 1e-5 \
  --num_train_steps 2000 \
  --logging_steps 50 \
  --report_to wandb \
  --wandb_project xxxx \
  --forget_sample_file data/forget_samples/OLMo-2-0425-1B/sampled_texts.jsonl \
  --run_name xxxx \
  --eval_text_file data/forget_samples/OLMo-2-0425-1B/sampled_texts_for_evaluate_loss.jsonl \
  --retain_weight 1.0 \
  --seq_len 32 \
  --seed 42 \
  --retain_train_data_file data/gms8k/OLMo-2-0425-1B/train.jsonl \
  --retain_eval_data_file data/gms8k/OLMo-2-0425-1B/test.jsonl
  ```

- **Retain only the ability of Mathematical QA (MathQA)**
```
python src/exclusive_unlearning/run_exclusive_unlearning_mathqa.py \
--model_name_or_path allenai/OLMo-2-0425-1B \
--output_dir out/unlearn_everything/mathqa/exclusive_unlearning \
--cache_dir xxxx \
--per_device_train_batch_size 4 \
--train_generate_batch_size 16 \
--learning_rate 1e-5 \
--num_train_steps 2000 \
--logging_steps 50 \
--report_to wandb \
--wandb_project xxxx \
--forget_sample_file data/forget_samples/OLMo-2-0425-1B/sampled_texts.jsonl \
--run_name xxxx \
--eval_text_file data/forget_samples/OLMo-2-0425-1B/sampled_texts_for_evaluate_loss.jsonl \
--retain_weight 1.0 \
--seq_len 32 \
--seed 42 \
--retain_train_data_file data/MathQA/train.json \
--retain_eval_data_file data/MathQA/test.json
```

### Comparisons with Other Settings
- **Retain only FT:** Our method consists of a forgetting term (that drives the model’s outputs toward a uniform distribution) and a retaining term (that fine-tunes on a specific dataset).  
As a baseline, we compare with fine-tuning only on the retain dataset. Example (SQuAD):
```
python src/retain_only_ft/run_retain_only_ft_squad.py \
--model_name_or_path allenai/OLMo-2-0425-1B \
--output_dir out/unlearn_everything/squad/retain_only_ft \
--cache_dir xxxx \
--per_device_train_batch_size 4 \
--learning_rate 1e-5 \
--num_train_steps 2000 \
--logging_steps 50 \
--report_to wandb \
--wandb_project xxxx \
--run_name xxxx \
--seed 42 \
--retain_train_data_file data/squad1.1/OLMo-2-0425-1B/train-v1.1.jsonl \
--retain_eval_data_file data/squad1.1/OLMo-2-0425-1B/validation-v1.1.jsonl
```

- **Gradient Ascent (negative CE loss):** Exclusive Unlearning is fundamentally different from traditional unlearning methods, which aim to forget specific knowledge while retaining all other abilities.  
Thus, we compare instead with Gradient Ascent, the simplest forgetting approach. Example (SQuAD):
```
python src/gradient_ascent/run_gradient_ascent_squad.py \
--model_name_or_path allenai/OLMo-2-0425-1B  \
--output_dir out/unlearn_everything/squad/gradient_ascent \
--cache_dir xxxx \
--per_device_train_batch_size 4 \
--train_generate_batch_size 16 \
--learning_rate 1e-5 \
--num_train_steps 2000 \
--logging_steps 50 \
--report_to wandb \
--wandb_project xxxx \
--forget_sample_file data/forget_samples/OLMo-2-0425-1B/sampled_texts.jsonl \
--run_name xxxx \
--eval_text_file data/forget_samples/OLMo-2-0425-1B/sampled_texts_for_evaluate_loss.jsonl \
--retain_weight 1.0 \
--seq_len 32 \
--seed 42 \
--retain_train_data_file data/squad1.1/OLMo-2-0425-1B/train-v1.1.jsonl \
--retain_eval_data_file data/squad1.1/OLMo-2-0425-1B/validation-v1.1.jsonl
```
### Forgetting Data Sampling
As mentioned above, the forgetting training and test data sampled from the six models are provided in `data/forget_samples`.  
For example, forgetting data is sampled as `batch_size × steps` sentences generated from the model itself:
```
python /data/experiments/0196_knowledge_reduction/exclusive_unlearning/src/forget_data_sampling/generate_sampling.py \
  --model_name_or_path allenai/OLMo-2-0425-1B \
  --output_file data/forget_samples/OLMo-2-0425-1B/sampled_texts.jsonl \
  --cache_dir /data/experiments/0196_knowledge_reduction/model \
  --num_train_steps 2000 \
  --per_device_train_batch_size 16 \
  --seq_len 32 \
  --temperature 2.0 \
  --seed 42 \
  --top_k 100 \
  --device cuda
```

The **Discussion section** of the paper analyzes this sampling strategy. Specifically, we compare sampling from a large-scale external corpus (Wikipedia for OLMo-2-1B, the Pile for pythia-1.4b) with sampling from the model itself.  
In our paper, Our results suggest that self-sampling provides a more cost-effective and comprehensive way to enforce forgetting across knowledge and abilities.  
- Random sampling from **Wikipedia** for OLMo-2-1B
```
python src/forget_data_sampling/olmo2_wiki_sampling.py
```
- Random sampling from the **Pile** for pythia-1.4b
```
python src/forget_data_sampling/pythia_pile_sampling.py
```