##  Codebase for ICLR submission "AdaRank: Adaptive Rank Pruning for Enhanced Model Merging"
### Install Dependencies
Install required dependencies(Python 3.8+ recommended) in ```requirements.txt```. 
### Preparing Datasets and Checkpoints
#### Datasets: 
Vision Datasets
-  Refer to Wang et al. https://github.com/nik-dim/tall_masks

Language Model Datasets
- Refer to Yu et al. https://github.com/yule-BUAA/MergeLM
- Refer to Huang et al. https://github.com/harveyhuang18/EMR_Merging
- You should replace ```cache_dir``` in ```lm/utils/load_config.py``` to your directory path that contains dataset.

#### Checkpoints: 
Vision Experiment
- 8 Tasks: Refer to Ilharco et al. https://github.com/mlfoundations/task_vectors
- 14/20 Tasks: Refer to Wang et al. https://github.com/nik-dim/tall_masks
- Place your checkpoints as:
```
your_directory
├── ViT-B-32_TA # 8 task checkpoints 
│   ├── Cars
│   ├── DTD
│   ├── ...
├── ViT-B-32 # 20 task checkpoints 
│    ├── Cars
│    ├── DTD
│    ├── ...
│    ├── FER2013
│    ├── ...
├── ViT-L-14_TA
└── ViT-L-14
```
- Since checkpoints for 8 tasks and 20 tasks are differently finetuned, to use checkpoints in ```ViT-{B or L}-{32 or 14}_TA```, set ```TA_MODE=True``` in configs. Otherwise, all checkpoints from ```ViT-{B or L}-{32 or 14}``` will be used. Default setting is ```TA_MODE=True```.

Language Model
- Roberta: https://huggingface.co/vanillaOVO/roberta_base_glue_ckpts/tree/main
- GPT2: https://huggingface.co/collections/tanganke/gpt-2-models-fine-tuned-on-tasks-from-glue-benchmark-664ab37d9e33e622679f541b
- You should replace ```weight_dir``` in ```lm/utils/load_config.py``` to your directory path that contains checkpoints.
- Place your checkpoints as:
```
your_directory (weight_dir)
├── roberta
│   ├── cola
│   ├── mnli
│   ├── ...
└── gpt2
    ├── gpt2_cola
    ├── gpt2_mnli
    ├── ...
``` 
### Vision Model Merging Experiment
You could find the default config in `/vision/configs/hydra_default.yaml`. Once you prepare the checkpoints and datasets, modify these options:
```python
# Data Path Configuration
data_location: "/path/to/your/dataset/folders/"  
weight_root: "/path/to/your/checkpoints/"  
save: "/path/to/your/checkpoints/" # Used for loading task heads; recommended to set in the same folder as checkpoints.  
openclip_cachedir: "/path/to/your/checkpoints" # Detects pretrained OpenCLIP checkpoints; downloads compatible versions if absent.
```
Note that you should not add ```_TA``` for checkpoint paths. ```TA_MODE``` option in configs will automatically jump into the right directory. 

For vision model merging experiments, we provide codes for merging 8, 14, 20 tasks for 2 backbones (ViT-B/32 and ViT-L/14), respectively.

8-Tasks Benchmark: ```python ./vision/main.py config_list_path="./configs/adarank_{B or L}_T8/adarank_{TA or CART or TSV}.yaml"```

14-Tasks Benchmark: ```python ./vision/main.py config_list_path="./configs/adarank_{B or L}_T14/adarank_{TA or CART or TSV}.yaml"```

20-Tasks Benchmark: ```python ./vision/main.py config_list_path="./configs/adarank_{B or L}_T20/adarank_{TA or CART or TSV}.yaml"```


### Language Model Merging Experiment

For language model experiments, we provide codes for merging 7 tasks for RoBERTa and GPT2.

RoBERTa: ```python ./lm/adarank_roberta_glue.py --exp_config="./config/roberta_{TA or CART or TSV}.yaml"```

GPT2: ```python ./lm/adarank_gpt_glue.py --exp_config="./config/gpt_{TA or CART or TSV}.yaml"```