# State-Adaptive Prompt Optimization (SAPO)

This repository contains the official implementation for our work **"Training Prompt Matters: State-Adaptive Optimization for Robust Fine-Tuning"**.

## 🛠 Setup

### 1. Environment

You can install the required libraries by running the following command:

```bash
pip install -r requirements.txt
```

### 2. API Configuration for Prompt Generation

Before training with SAPO, you need to configure the LLM used for generating paraphrase prompts.

Please modify the `API_KEY` and `URL` in the following file: ```src/prompt_gen/API.py```

## 📂 Data Preparation

The datasets used in our experiments are organized in the following directories:

* **SUPERNI**: Located in `SUPERNI/`
* **TRACE**: Located in `TRACE/`


## 🚀 Training

We provide training scripts for 4 methods: **LoRAInc**, **LoRAInc + SAPO**, **O-LoRA**, and **O-LoRA + SAPO**. The scripts are located in the `scripts/` folder.

Below are the examples for training **Qwen3-8B** on the **NI_SEQ_C1** task sequence.

### 1. LoRAInc

```bash
mkdir -p saves/LoraInc/qwen3-8b/ni_seq_c1/logs/ && \
bash scripts/LoraInc/qwen3-8b/ni_seq_c1.sh > saves/LoraInc/qwen3-8b/ni_seq_c1/logs/train.log 2>&1
```

### 2. LoRAInc + SAPO (Ours)

```bash
mkdir -p saves/LoraInc_SAPO/qwen3-8b/ni_seq_c1/logs/ && \
bash scripts/LoraInc_SAPO/qwen3-8b/ni_seq_c1.sh > saves/LoraInc_SAPO/qwen3-8b/ni_seq_c1/logs/train.log 2>&1
```

### 3. O-LoRA

```bash
mkdir -p saves/O-LoRA/qwen3-8b/ni_seq_c1/logs/ && \
bash scripts/O-LoRA/qwen3-8b/ni_seq_c1.sh > saves/O-LoRA/qwen3-8b/ni_seq_c1/logs/train.log 2>&1
```

### 4. O-LoRA + SAPO (Ours)

```bash
mkdir -p saves/O-LoRA_SAPO/qwen3-8b/ni_seq_c1/logs/ && \
bash scripts/O-LoRA_SAPO/qwen3-8b/ni_seq_c1.sh > saves/O-LoRA_SAPO/qwen3-8b/ni_seq_c1/logs/train.log 2>&1
```

## 📊 Evaluation

We provide the initial zero-shot performance of different models on SUPERNI and TRACE in the `original_performance/` directory.

To evaluate the Continual Learning metrics (AP, BWT, FWT), run `src/run_evaluation.py`.

**Example:** Evaluate LoRAInc on Qwen3-8B (NI_seq_C1):

```bash
python3 src/run_evaluation.py \
    --outputs_dir saves/LoraInc/qwen3-8b/ni_seq_c1/outputs \
    --initial_metric_file original_performance/qwen3-8b.json
```

**Arguments:**

* `--outputs_dir`: The directory containing the training outputs (e.g., subfolders like `1-taskXXX`, `2-taskXXX`).
* `--initial_metric_file`: The JSON file containing the model's initial performance.

## 💐 Acknowledgments

This repository benefits from [O-LoRA](https://github.com/cmnfriend/O-LoRA). We thank the authors for their wonderful work.


```

```
