# Guru

It's initialized from [verl](https://github.com/volcengine/verl). verl's README is appended below.

## Setup

```bash
conda create -n guru python=3.12
conda activate guru
conda install -c nvidia/label/cuda-12.4.0 cuda-toolkit cuda-nvcc
pip install uv # using uv to install packages is faster than pip
uv pip install torch==2.6.0
uv pip install flash-attn==2.7.3 --no-build-isolation
uv pip install -e .[gpu,math]
```

## Training

```bash
sbatch scripts/train/m1_qwen_7b_guru15k.sh
```