model_name,thinking_mode,model_family,instruction_tuned,source,ARC-Challenge,Avg,BBH,GPQA,GSM8K,IFEval,MATH,MMLU,MMLU-Pro,MUSR,OpenBookQA,PIQA,SciQ,WinoGrande
Falcon-3-10B,Non-thinking,Falcon,No,Falcon Report (Base models),,27.59,41.38,12.75,,36.48,24.77,,36.0,14.17,,,,
Falcon-3-7B,Non-thinking,Falcon,No,Falcon Report (Base models),,24.72,31.56,12.86,,34.16,19.26,,32.34,18.14,,,,
Falcon-3-10B-Instruct,Non-thinking,Falcon,Yes,Falcon Report (Instruct models),,35.19,44.82,10.51,,78.17,25.91,,38.1,13.61,,,,
Falcon-3-7B-Instruct,Non-thinking,Falcon,Yes,Falcon Report (Instruct models),,34.91,37.92,8.05,,76.12,31.87,,34.3,21.17,,,,
Falcon-3-1B-Instruct,Non-thinking,Falcon,Yes,Falcon Report (Falcon-3-1B/3B Instruct),45.9,,39.0,26.5,38.6,54.4,1.0,43.9,18.6,35.1,40.0,72.0,86.8,60.2
Falcon-3-3B-Instruct,Non-thinking,Falcon,Yes,Falcon Report (Falcon-3-1B/3B Instruct),58.5,,45.4,29.6,71.9,68.3,19.9,55.7,29.7,40.2,42.2,74.4,95.6,65.0
Falcon-3-7B-Instruct,Non-thinking,Falcon,Yes,Falcon Report (Falcon-3-7B/10B Instruct detailed),65.9,,52.4,32.0,79.1,76.5,29.4,68.0,40.7,46.4,45.8,78.8,94.7,70.4
Falcon-3-10B-Instruct,Non-thinking,Falcon,Yes,Falcon Report (Falcon-3-7B/10B Instruct detailed),64.5,,58.4,33.5,83.1,78.0,22.1,71.6,44.0,41.1,48.2,78.4,90.4,71.0
