# Project Description

This code is developed based on the [MM-Eureka](https://github.com/ModalMinds/MM-EUREKA) repository. For environment setup and dependency installation, please refer to the original repository’s [README](https://github.com/ModalMinds/MM-EUREKA#readme).

## Running Scripts

You can use the following scripts to run training tasks using CPGD:

- Single Node Training

```shell
sh examples/scripts/train_cpgd_qwen_7b_single_node.sh
```

- Multi-Node Training

```shell
sh examples/scripts/train_cpgd_qwen_7b_multi_node.sh
```

**Estimated Training Time:** \~60 × 8 H100 GPU hour

## Reproducing GRPO Training Collapse

To reproduce the training collapse phenomenon observed with GRPO, run the following script:

```shell
sh examples/scripts/grpo_training_collapse.sh
```

**Estimated Training Time:** \~120 × 8 H100 GPU hour