../eval_out_csd_packing_llama3p1_8B_Inst_dist_sft_wisub_ch1_v1_100_150k_64gbs_gradAcc4_1epochs_1e6/math-500 2025-04-12 01:57:21 Final Accuracy: 53.4
../eval_out_csd_packing_llama3p1_8B_Inst_dist_sft_wisub_ch1_v1_100_150k_64gbs_gradAcc4_1epochs_1e6/minerva_math 2025-04-12 02:03:58 Final Accuracy: 32.7
../eval_out_csd_packing_llama3p1_8B_Inst_dist_sft_wisub_ch1_v1_100_150k_64gbs_gradAcc4_1epochs_1e6/gsm8k 2025-04-12 02:10:39 Final Accuracy: 85.3
../eval_out_csd_packing_llama3p1_8B_Inst_dist_sft_wisub_ch1_v1_100_150k_64gbs_gradAcc4_1epochs_1e6/olympiadbench 2025-04-12 02:30:07 Final Accuracy: 19.6
../eval_out_csd_packing_llama3p1_8B_Inst_dist_sft_wisub_ch1_v1_100_150k_64gbs_gradAcc4_1epochs_1e6/amc23 2025-04-12 02:34:33 Final Accuracy: 27.5
../eval_out_csd_packing_llama3p1_8B_Inst_dist_sft_wisub_ch1_v1_100_150k_64gbs_gradAcc4_1epochs_1e6/aime24 2025-04-12 02:39:14 Final Accuracy: 10.0
../eval_out_csd_packing_llama3p1_8B_Inst_dist_sft_wisub_ch1_v1_100_150k_64gbs_gradAcc4_1epochs_1e6/theoremqa 2025-04-12 03:04:10 Final Accuracy: 28.9
64gbs_gradAcc4_1epochs_1e6 gpqa  n_shot=5 2025-04-12 03:06:03 Final Accuracy: 0.3181818181818182
../eval_out_csd_packing_llama3p1_8B_Inst_dist_sft_wisub_ch1_v1_100_150k_64gbs_gradAcc4_1epochs_1e6/mmlu-pro 2025-04-12 03:38:00 Final Accuracy: 0.3508144946516943
