# MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling

> Submitted to ICML2025

## 1. Download the datasets

You have to download the commonsense reasoning and math reasoning dataset from a public repository, LLM-Adapters. Other datasets in our experiments are not required to download manually.

Training Set: Download the complete commonsense datasets from [here](https://github.com/AGI-Edgerunners/LLM-Adapters/tree/main/dataset) and the commonsense 170k finet-uning dataset from [here](https://github.com/AGI-Edgerunners/LLM-Adapters/blob/main/ft-training_set/commonsense_170k.json), then organize the data as follows. Put them in the path  MISA/datasets.

Testing Set: download all datasets from [here](https://github.com/AGI-Edgerunners/LLM-Adapters/tree/main/dataset), and put them in the path  MISA/datasets.

## 2. Build Environment

```bash
conda create --name MISA
conda activate MISA
pip install -e .
pip install -r requirements.txt
```

## 3.  Run Experiments

We provide configurations in the script path. 

Runing examples:

```bash
sh script/commonsense_MISA.sh
sh script/commonsense_BAdam.sh
```
