# Learning Rate Matters: Vanilla LoRA May Suffice for LLM Fine-tuning


Dear ICML reviewers,    
This codebase provides all the necessary components to reproduce the experimental results reported in our paper. Our implementation is built upon several prior works (see Appendix B.4 in the paper dradt), with slight modifications to facilitate a large-scale grid search pipeline.

## [Step 1] Environment Setup
We provide two options for environment setup. **Option 2** is verified by the authors.

### Option 1: Legacy (Following [PiSSA Guidelines](https://github.com/GraphPKU/PiSSA))
```bash
conda create -n lora-peft-env python=3.10 -y
conda activate lora-peft-env
conda install nvidia/label/cuda-12.1.0::cuda-toolkit
conda install pytorch==2.4.0 torchvision=0.19.0 pytorch-cuda=12.1 -c pytorch -c nvidia
pip install -r requirements_pissa.txt
pip install flash-attn --no-build-isolation
```


### Option 2: Recommended (Verified by Authors)
```bash
conda create -n lora-peft-env python=3.12 -y
conda activate lora-peft-env

# Install PyTorch (Adjust index-url based on your CUDA version)
pip3 install torch==2.7.1 torchvision==0.22.1 torchaudio==2.7.1 --index-url https://download.pytorch.org/whl/cu126

# Install requirements
pip3 install -r requirements.txt

# Install Flash-Attention (Download the wheel matching your specific cuda version)
wget https://github.com/Dao-AILab/flash-attention/releases/download/v2.8.2/flash_attn-2.8.2+cu12torch2.7cxx11abiFALSE-cp312-cp312-linux_x86_64.whl
pip3 install flash_attn-2.8.2+cu12torch2.7cxx11abiFALSE-cp312-cp312-linux_x86_64.whl

# Install other packages
pip3 install scipy wandb
```

p.s. the wheel for flash-attn can be found at `Dao-AILab/flash-attention/releases`

## [Step 2] Dataset Preparation

We utilize the preprocessed datasets released by the PiSSA authors. Download them to the root directory using the following commands:

```bash
pip install "huggingface-hub>=0.34.0,<1.0"
pip install "fsspec[http]<=2025.10.0"
pip check
huggingface-cli download --repo-type dataset --resume-download fxmeng/pissa-dataset --local-dir pissa-dataset
```

You should see `pissa-dataset` appear in you local directory


## [Step 3] Core Implementation Reference
For reviewers interested in the technical implementation of each PEFT method, please refer to the following files:

| Method | Implementation Path | Source Reference |
|--------|---------------------|------------------|
| LoRA | `train.py` (Standard PEFT) | HuggingFace PEFT |
| DoRA | `train.py` (`use_dora=True`) | HuggingFace PEFT |
| PiSSA | `./utils/init_pissa.py` | `/utils/init_pissa.py` in the official codebase |
| MiLoRA | `./utils/init_milora.py` | `/svd_init.py` in the official codebase |
| InitAB | `./utils/init_AB_util.py` | `/utils/nzlora.py` in the official codebase |


# [Step 4] Run Training & Evaluation

## 1. Basic Files
+ `./scripts/train.sh`
+ `./scripts/test.sh`

## 2. Running Grid Search
```bash
# Reporduce paper Figure 1:
chmod +x ./scripts/qwen/math.sh  
./scripts/qwen/math.sh # please see the script for line-by-line explanation
```


## 3. Check the rsults 
+ Your results (`perf.json`, `model_response.jsonl`) will be saved to `./output`
+ Adjust `output_home` in `./scripts/qwen/math.sh` to change to other directories.
+ To check fine-tuned performance, run:
```bash
cd exp_record

## get all current hyperparameter search results into a json file
python3 get_record.py --task metamath --model 'qwen-3-0.6b' --output '../output' --save_path './json/qwen-3-0.6b/metamath'

## terminal strucutred output
python3 summary.py --record_path './json/qwen-3-0.6b/metamath/metamath_qwen_3_0.6b_rank128_bs16.json'
```
+ For other model-task combinations, check `./scripts/gemma` and `./scripts/llama`


## End
