# ⚒️ Installation

torch2.6.0+cu124+vllm0.8.5

```
cd PSFT

conda create -n psft python==3.10
conda activate psft
cd verl
pip install --no-deps -e .
```

# 🚀 Quick Start

## Prepare Train Data

`python ./prepare_data.py`

You can modify this file to support your PSFT training dataset, ensuring that the key demonstration is retained in the training parquet. It is not necessary for the test parquet.


## Training

We provide the implementation within the **open-sourced** verl framework; see `PSFT/verl/recipe/psft`.

## Evaluation

```
cd evaluation
serve run eval.llm:build_app model=aaa/bbb/ccc tensor-parallel-size=1

# open another terminal
python eval/eval.py --temperature 0.7 --top_p 0.95 --max_tokens 10240 --model ccc --test_file eval/data/aime-2024.parquet
```

