## 1. Create a Conda Environment
```bash
conda create -n myenv python=3.10
# Activate the conda environment
conda activate myenv
# Install required Python packages
pip install -r requirements.txt
```
## 2. Prepare the Dataset
Dataset Configuration (GSM8K)
Before training, you need to update the GSM8K dataset paths in the training script.
Open the following file:
train_code/train_qwen3_GSM8K_lowrankgradient.py
Locate the dataset loading code:
```bash
dataset = load_dataset(
    "parquet",
    data_files={
        "train": "/path/to/train-00000-of-00001.parquet",
        "test": "/path/to/test-00000-of-00001.parquet"
        }
)
```
Replace the placeholder paths with the actual locations of the GSM8K training and test datasets on your local machine, for example:
```bash
dataset = load_dataset(
    "parquet",
    data_files={
        "train": "data/GSM8K/train-00000-of-00001.parquet",
        "test": "data/GSM8K/test-00000-of-00001.parquet"
        }
)
```
## 3. Training
Open the following file:train_code/train_qwen.sh.Fill in the following information.
```bash
train_script="/path/to/train_qwen3_GSM8K_lowrankgradient.py"
eval_script="/path/to/eval_GSM8K.py"
gsm8k_test_path="/path/to/test-00000-of-00001.parquet"
```
After setting up the environment and configuring the GSM8K dataset paths, start training by running:
```bash
bash train_code/train_qwen.sh
```

## 4. Optional Settings
If you want to use Ours, add the following argument to the command line:
```bash
--use_gradient_compress True
```
If you want to use Ours+, add both of the following arguments:
```bash
--use_gradient_compress True
--use_optimizer_compress True
```