####  Training Backdoored LLMs via Fine-Tuning

The training scripts are located in `./script`.
To facilitate the reproduction of different attacks, we provided implementation configurations of various attacks in `./configs`. 
```bash
bash script/run_train.sh
```

#### Attack Success Rate (ASR) Evaluation

We adopt a decoding strategy with `top-p = 0.75` and `temperature = 0` to generate different unsafe responses.

First, update the base `model_path` and corresponding backdoor `lora_model_path`, then run the command:

```shell
python backdoor_evaluate.py
```

For GPT-Judge, run the command:

```shell
python gpt4_judge_quality.py
```

