# ReadMe

## Setup

```bash
conda create -n concise_hint python=3.10 
conda activate concise_hint
pip install -r requirements.txt
```

## Run

### 1. Start the LLM server

```bash
CUDA_VISIBLE_DEVICES=0 vllm serve Qwen/Qwen3-4B --enable-reasoning \
--reasoning-parser deepseek_r1  --enable-prefix-caching --max_model_len 25000  --port 8000
```

### 2.1 Run concise hint

```bash
python -u run_main.py --model  Qwen/Qwen3-4B  --dataset gsm8k --repeat_exp_num 5  --max-tokens 10240  --bs 64  --port 8000 \
--enable_adap --enable_hint --exp Qwen3-4B-gsm8k-ours
```

### 2.2 Run baseline

```bash
python -u run_main.py --model  Qwen/Qwen3-4B  --dataset gsm8k --repeat_exp_num 5  --max-tokens 10240  --bs 64  --port 8000 --exp Qwen3-4B-gsm8k-base
```
