When inside the math_grpo folder, to use the dataset if competition_math does not load, use 

https://huggingface.co/datasets/nlile/hendrycks-MATH-benchmark


if even this does not work for some reason, use
from datasets import load_from_disk

dataset = load_from_disk("./processed_math_train"). Similar for the test set.

Use train_efficient.py which has the GRPO code for training on MATH. Use eval2.py or run_matheval.py to run evaluations (or test your existing eval code and see if it gives the same results approximately)