The training code is in the `src/open_r1` directory. Our code is based on the `open-r1-multimodal` repository.

The agent pipeline code is in the `agent` directory.

`run_sft.sh` runs SFT training.

`run_eval_multi_task_rl.sh` runs multi-task GRPO training.

Sample data is in the `data_sample` directory, and example generated slides are in the `demo` directory.