Add datasets from our paper here.

We didn't add them as a commit since we already have the datasets elsewhere.

Add datasets as gpqa_main.csv, gsm8k.json, human-eval-v2.jsonl, and mmlu_test.json to this folder. We avoid this due to potential copyright issues.
