MathCAMPS Logo

CodeLlama 13B

Performance on individual Common Core standards, grouped by grade level.
IFUP Acc. = Incremental Followup Accuracy, CFUP Acc. = Counterfactual Followup Accuracy, Total FUPs Seen = Number of followup questions a model sees, since a model only sees followups if it answers the main question correctly.