../eval_output_llama3_math_cot_CFT_packing_llama3p1_8B_Inst_llama3p3_70b_critique_WebInsSubChunk1/math 2025-02-23 20:03:32 Final Accuracy: 36.3
../eval_output_llama3_math_cot_CFT_packing_llama3p1_8B_Inst_llama3p3_70b_critique_WebInsSubChunk1/minerva_math 2025-02-23 20:04:18 Final Accuracy: 18.0
../eval_output_llama3_math_cot_CFT_packing_llama3p1_8B_Inst_llama3p3_70b_critique_WebInsSubChunk1/gsm8k 2025-02-23 20:05:17 Final Accuracy: 85.9
../eval_output_llama3_math_cot_CFT_packing_llama3p1_8B_Inst_llama3p3_70b_critique_WebInsSubChunk1/olympiadbench 2025-02-23 20:06:53 Final Accuracy: 13.9
../eval_output_llama3_math_cot_CFT_packing_llama3p1_8B_Inst_llama3p3_70b_critique_WebInsSubChunk1/amc23 2025-02-23 20:07:12 Final Accuracy: 15.0
../eval_output_llama3_math_cot_CFT_packing_llama3p1_8B_Inst_llama3p3_70b_critique_WebInsSubChunk1/aime24 2025-02-23 20:07:31 Final Accuracy: 6.7
../eval_output_llama3_math_cot_CFT_packing_llama3p1_8B_Inst_llama3p3_70b_critique_WebInsSubChunk1/theoremqa 2025-02-23 20:08:52 Final Accuracy: 17.9
 gpqa  n_shot=0 2025-02-23 20:10:27 Final Accuracy: 0.2676767676767677
../eval_output_llama3_math_cot_CFT_packing_llama3p1_8B_Inst_llama3p3_70b_critique_WebInsSubChunk1/mmlu-pro 2025-02-23 20:28:19 Final Accuracy: 0.1997174201961671
