### To start
The code is modified from verl. To start, choose the corrsponinding model recipe (Qwen2.5-math-7B, Qwen3-14B, DS-Qwen-7B) and then run the corrsponinding scripts. 
The parameters for SLAT is set in verl/trainer/config/ppo_trainer.yaml