

### Required Packages

- verl
- transformers
- datasets
- ...

### Dataset Preparation

```python get_data.py```


### run verl GRPO scripts

```./run_training.sh```

### convert fsdp to HF model

```python merge_fsdp_to_hf.py```

### Evaluation (ongoing)

refer to the ''inference/'' folder




