# SAIM-CL: Subspace Alignment for Incremental Merging in Continual Learning

This project implements the **SAIM (Subspace Alignment for Incremental Merging)** method, supporting continual fine-tuning and model merging for both **ViT (Vision Transformer)** and **LLM (Large Language Model)**.

---

## 1. Environment Setup & Preparation (Conda)

The project is divided into ViT and LLM parts. It is recommended to create separate Conda environments for each.

### ViT Environment Setup

#### 1.1 Install Environment
```bash
cd ViT
conda env create -f environment.yml
conda activate iso-merging
```

#### 1.2 Download Pretrained Models
Please download the fine-tuned checkpoints provided by [Task Singular Vectors](https://drive.google.com/drive/folders/1UEM1Thcz1c7dc1nji1i5uTN53Kf6G3-e?usp=sharing) (the same as those provided by [Tall Masks](https://drive.google.com/drive/folders/15ParSng4d5xSdaWdBFsg1617zPXT8Dae)).

#### 1.3 Download Datasets
Most datasets (such as CIFAR10, MNIST, etc.) will be automatically downloaded via `torchvision` or `huggingface`. For datasets that require manual preparation (such as Cars, DTD, EuroSAT, SUN397), please refer to the instructions in [this Issue](https://github.com/mlfoundations/task_vectors/issues/1).
Alternatively, you can directly download my Baidu Netdisk content (which contains all datasets that need to be downloaded manually): https://pan.baidu.com/s/1645oRPGv9v0L1RdigeMwYw?pwd=f7xd, extraction code: f7xd.

#### 1.4 Configure Paths
Before running, please modify `model_location` and `data_location` in [ViT/config/config.yaml](ViT/config/config.yaml):
- `model_location`: Path to store the downloaded checkpoints.
- `data_location`: Path to store the datasets.

### LLM Environment Setup
```bash
# Create base environment
conda create -n saim-llm python=3.8
conda activate saim-llm

# Install dependencies
cd LLM
pip install -r requirements.txt
# If you need flash-attention
pip install flash-attn --no-build-isolation
```

---

## 2. ViT Usage Guide

The ViT part uses `Hydra` for configuration management.

### Run Continual Fine-tuning & Merging
```bash
cd ViT
python continual_learning.py
```

### Modify Configuration Files
- **Global Config**: [ViT/config/config.yaml](ViT/config/config.yaml)
  - `model`: Specify the model architecture (e.g., `ViT-B-32`).
  - `merge_methods`: Specify the list of merging methods, e.g., `["SAIM", "task_arithmetic"]`.
- **Method-specific Config**: [ViT/config/method/SAIM.yaml](ViT/config/method/SAIM.yaml)
  - `use_sabcd`: Whether to use the SABCD optimizer.
  - `optimize_alpha`: Whether to automatically optimize the merging coefficient.
  - When loading parameter settings, method-specific config is loaded first, then overrides global config. (Many parameters are legacy from previous projects; to avoid crashes, it is recommended not to delete them.)
---

## 3. LLM Usage Guide

The LLM part supports continual learning (fine-tuning + merging) from scratch, as well as continual merging of existing fine-tuned models.

### Continual Learning (Fine-tuning + Merging)
```bash
cd LLM
python continual_learning.py \
    --base-model meta-llama/Llama-3.2-1B-Instruct \
    --merge-method SAIM \
    --optimizer sabcd
```

### Modify Configurations/Arguments
LLM is mainly configured via command-line arguments or by modifying [LLM/config/arguments.py](LLM/config/arguments.py):
- `--optimizer`: Options are `sabcd`, `adam`, `sam`.
- `--merge-method`: Options are `SAIM`, `task_arithmetic`, `ties_merge`, `dare`, etc.
- `--task-names`: Defaults to `C-STANCE,FOMC,MeetingBank,ScienceQA,NumGLUE-cm,NumGLUE-ds,20Minuten`.
- All parameters have default values and can be overridden via command line.
---

## 4. Important Notes

1.  **SABCD Optimizer**: Using `sabcd` during fine-tuning helps maintain subspace consistency, which is beneficial for subsequent SAIM merging.
2.  **Experiment Resumption**: LLM supports resuming experiments from interruption using `--continue-experiment` and `--prev-experiment-dir`.
3.  **Data Paths**: For LLM tasks, the corresponding dataset JSON files already exist in the `datasets/` directory, so no manual download is required.
