## Requirements

Run (same as the other code repo)

```
pip install -r requirements.txt
```

### Usage

To run an evaluation with merging every 2 models every 8 models in glue, simply run:

```sh
python run_in_script.py --models_and_datas=models_and_datas_full_qwen_3b --smaller_batch=1 --output=03107b --tests=8X7whole

```

This command will ask to run with the corresponding configuration in *models_and_datas.py*. To change datasets and file path to stored models, you can change configuration in *models_and_datas.py*. 

*model_name_to_ckpt()* function in *run.py* will automatically find the stored path in *../llama_on_glue/checkpoints*. If stored path changed, make sure to adapt this function for new setting. 

For detailed parameter setting and other evaluation pattern, see *run_in_script.py*.

To calculate different kinds of conflicts between checkpoints, you can check *conflict_script.py*, except Hidden-state distance similarity, which can be calculated through *compare_lora_models.py*.

### Logs

All of the running logs can be seen in *output/*, with metric and model outputs.
