Model,Total Score,Included Tasks,Included Score,Avg Tests Passed,Pass Rate (%),Medal,Human Relative Rank,Human Percentile,Rank,Relative Score (%)
gpt-5,505.0,6,505.0,85.08,83.33,Silver,15,85.71428571428571,1,84.17
grok-4-fast-reasoning,505.0,6,505.0,84.88,83.33,Silver,15,85.71428571428571,2,84.17
gemini-2.5-pro,426.0,6,426.0,84.98,50.0,Silver,23,77.55102040816327,2,71.0
gpt-oss-120b-high,419.0,6,419.0,75.64,66.67,Silver,24,76.53061224489795,3,69.83
gpt-o3-mini-high,382.0,6,382.0,70.55,50.0,Bronze,27,73.46938775510205,4,63.67
gpt-oss-120b_sp4,372.0,6,372.0,64.2,50.0,Bronze,28,72.44897959183673,6,62.0
gpt-oss-20b-high,372.0,6,372.0,63.25,50.0,Bronze,28,72.44897959183673,5,62.0
gpt-oss-120b-medium,372.0,6,372.0,65.15,50.0,Bronze,28,72.44897959183673,6,62.0
gemini-2.5-flash,353.0,6,353.0,65.65,50.0,Bronze,29,71.42857142857143,7,58.83
gpt-oss-120b_sp2,319.0,6,319.0,57.69,50.0,Bronze,31,69.38775510204081,10,53.17
gpt-oss-120b_sp3,319.0,6,319.0,57.69,50.0,Bronze,31,69.38775510204081,11,53.17
gpt-oss-20b-medium,319.0,6,319.0,57.69,50.0,Bronze,31,69.38775510204081,8,53.17
gpt-oss-120b-low,319.0,6,319.0,58.17,50.0,Bronze,31,69.38775510204081,9,53.17
gpt-oss-120b_sp1,319.0,6,319.0,59.6,50.0,Bronze,31,69.38775510204081,14,53.17
seed-oss_-1,311.0,6,311.0,55.56,50.0,Bronze,32,68.36734693877551,10,51.83
claude-sonnet-4.5,254.0,6,254.0,58.02,16.67,Bronze,47,53.06122448979592,16,42.33
seed-oss_8192,211.0,6,211.0,37.18,33.33,Bronze,51,48.97959183673469,11,35.17
gpt-oss-20b-low,208.0,6,208.0,38.46,33.33,Bronze,51,48.97959183673469,12,34.67
gpt-4.1,189.0,6,189.0,40.17,16.67,None,56,43.87755102040816,13,31.5
Qwen3-32B,172.0,6,172.0,43.79,16.67,None,56,43.87755102040816,14,28.67
deepseek-reasoner,171.0,6,171.0,43.19,16.67,None,56,43.87755102040816,15,28.5
Qwen3-14B,160.0,6,160.0,38.04,16.67,None,63,36.734693877551024,16,26.67
deepseek-chat,158.0,6,158.0,37.77,16.67,None,64,35.714285714285715,17,26.33
Qwen3-30B,158.0,6,158.0,39.56,16.67,None,64,35.714285714285715,18,26.33
Qwen3-4B,146.0,6,146.0,32.25,16.67,None,66,33.673469387755105,19,24.33
OpenCodeReasoning-Nemotron-32B-IOI,145.0,6,145.0,34.67,16.67,None,66,33.673469387755105,26,24.17
DeepSeek-R1-Distill-Qwen-32B,130.0,6,130.0,28.87,16.67,None,68,31.632653061224488,20,21.67
OlympicCoder-7B,128.0,6,128.0,32.21,16.67,None,70,29.591836734693878,28,21.33
DeepSeek-R1-Distill-Llama-70B,123.0,6,123.0,31.38,16.67,None,71,28.571428571428573,21,20.5
seed-oss_4096,119.0,6,119.0,25.46,16.67,None,72,27.551020408163264,22,19.83
seed-oss_2048,111.0,6,111.0,24.66,16.67,None,72,27.551020408163264,23,18.5
seed-oss_1024,111.0,6,111.0,22.97,16.67,None,72,27.551020408163264,24,18.5
QwQ-32B,111.0,6,111.0,25.67,16.67,None,72,27.551020408163264,25,18.5
Qwen3-8B,111.0,6,111.0,27.83,16.67,None,72,27.551020408163264,26,18.5
Qwen3-32B-Non-Thinking,108.0,6,108.0,22.18,16.67,None,72,27.551020408163264,27,18.0
Llama-3.3-70B-Instruct,82.0,6,82.0,20.53,0.0,None,79,20.408163265306122,28,13.67
Qwen2.5-72B,78.0,6,78.0,19.92,0.0,None,82,17.346938775510203,29,13.0
Qwen2.5-Coder-14B-Instruct,68.0,6,68.0,18.67,0.0,None,86,13.26530612244898,30,11.33
Qwen3-14B-Non-Thinking,66.0,6,66.0,17.13,0.0,None,86,13.26530612244898,31,11.0
Mistral-Large-Instruct-2411,29.0,6,29.0,15.39,0.0,None,91,8.16326530612245,34,4.83
DeepSeek-R1-Distill-Llama-8B,24.0,6,24.0,8.91,0.0,None,93,6.122448979591836,35,4.0
Qwen3-8B-Non-Thinking,24.0,6,24.0,13.32,0.0,None,93,6.122448979591836,37,4.0
seed-oss_512,38.0,6,38.0,9.89,0.0,None,91,8.16326530612245,32,6.33
Codestral-22B-v0.1,7.0,6,7.0,8.91,0.0,None,96,3.061224489795918,41,1.17
DeepSeek-R1-Distill-Qwen-14B,35.0,6,35.0,9.99,0.0,None,91,8.16326530612245,33,5.83
Mistral-Small-3.1-24B-2503,24.0,6,24.0,12.48,0.0,None,93,6.122448979591836,36,4.0
Qwen2.5-Coder-7B-Instruct,7.0,6,7.0,9.45,0.0,None,96,3.061224489795918,43,1.17
Qwen3-4B-Non-Thinking,24.0,6,24.0,12.55,0.0,None,93,6.122448979591836,38,4.0
DeepSeek-Coder-V2-Lite-Instruct,7.0,6,7.0,11.9,0.0,None,96,3.061224489795918,44,1.17
Qwen2.5-Coder-32B-Instruct,18.0,6,18.0,13.39,0.0,None,93,6.122448979591836,39,3.0
seed-oss_0,11.0,6,11.0,8.4,0.0,None,96,3.061224489795918,40,1.83
Llama-4-Scout,7.0,6,7.0,6.25,0.0,None,96,3.061224489795918,42,1.17
seed-oss_16384,0.0,6,0.0,0.0,0.0,None,96,3.061224489795918,45,0.0
DeepSeek-R1-Distill-Qwen-7B,0.0,6,0.0,0.0,0.0,None,96,3.061224489795918,46,0.0
Llama-3.1-8B-Instruct,0.0,6,0.0,3.45,0.0,None,96,3.061224489795918,47,0.0
Qwen3-30B-Non-Thinking,0.0,6,0.0,5.18,0.0,None,96,3.061224489795918,48,0.0
