Model,Total Score,Included Tasks,Included Score,Avg Tests Passed,Pass Rate (%),Medal,Human Relative Rank,Human Percentile,Rank,Relative Score (%)
gpt-5,209.0,6,209.0,49.26,16.67,Bronze,34,57.69230769230769,1,34.83
grok-4-fast-reasoning,200.0,6,200.0,44.0,16.67,Bronze,37,53.84615384615385,2,33.33
claude-sonnet-4.5,195.0,6,195.0,51.69,16.67,Bronze,38,52.56410256410256,3,32.5
gpt-o3-mini-high,132.0,6,132.0,30.26,16.67,None,55,30.76923076923077,3,22.0
gemini-2.5-pro,121.0,6,121.0,34.64,0.0,None,60,24.358974358974358,4,20.17
gpt-oss-120b-high,100.0,6,100.0,19.61,16.67,None,66,16.666666666666668,5,16.67
deepseek-reasoner,100.0,6,100.0,24.36,16.67,None,66,16.666666666666668,6,16.67
gpt-oss-20b-high,93.0,6,93.0,28.88,0.0,None,68,14.102564102564102,7,15.5
gemini-2.5-flash,87.0,6,87.0,25.83,0.0,None,68,14.102564102564102,8,14.5
gpt-oss-20b-medium,83.0,6,83.0,23.18,0.0,None,68,14.102564102564102,9,13.83
gpt-oss-120b_sp1,73.0,6,73.0,20.14,0.0,None,70,11.538461538461538,11,12.17
gpt-oss-120b-medium,72.0,6,72.0,22.19,0.0,None,71,10.256410256410257,10,12.0
OlympicCoder-7B,51.0,6,51.0,16.37,0.0,None,75,5.128205128205129,13,8.5
gpt-oss-120b_sp4,51.0,6,51.0,14.0,0.0,None,75,5.128205128205129,14,8.5
gpt-oss-120b_sp2,51.0,6,51.0,14.0,0.0,None,75,5.128205128205129,15,8.5
gpt-4.1,47.0,6,47.0,20.95,0.0,None,76,3.8461538461538463,11,7.83
gpt-oss-120b-low,32.0,6,32.0,11.42,0.0,None,76,3.8461538461538463,12,5.33
gpt-oss-120b_sp3,32.0,6,32.0,12.4,0.0,None,76,3.8461538461538463,18,5.33
seed-oss_-1,21.0,6,21.0,12.79,0.0,None,77,2.5641025641025643,13,3.5
Qwen2.5-Coder-32B-Instruct,14.0,6,14.0,8.25,0.0,None,78,1.2820512820512822,14,2.33
Qwen3-4B,12.0,6,12.0,6.31,0.0,None,79,0.0,15,2.0
Qwen3-30B,12.0,6,12.0,7.74,0.0,None,79,0.0,16,2.0
DeepSeek-R1-Distill-Qwen-7B,0.0,6,0.0,0.0,0.0,None,79,0.0,18,0.0
DeepSeek-R1-Distill-Llama-8B,0.0,6,0.0,0.55,0.0,None,79,0.0,19,0.0
Llama-4-Scout,0.0,6,0.0,1.54,0.0,None,79,0.0,20,0.0
gpt-oss-20b-low,0.0,6,0.0,3.45,0.0,None,79,0.0,21,0.0
Qwen2.5-Coder-14B-Instruct,0.0,6,0.0,3.58,0.0,None,79,0.0,22,0.0
DeepSeek-Coder-V2-Lite-Instruct,0.0,6,0.0,1.82,0.0,None,79,0.0,26,0.0
deepseek-chat,0.0,6,0.0,6.58,0.0,None,79,0.0,24,0.0
Llama-3.1-8B-Instruct,0.0,6,0.0,1.64,0.0,None,79,0.0,27,0.0
Qwen2.5-72B,0.0,6,0.0,1.82,0.0,None,79,0.0,28,0.0
Qwen3-14B-Non-Thinking,0.0,6,0.0,1.54,0.0,None,79,0.0,29,0.0
QwQ-32B,0.0,6,0.0,7.48,0.0,None,79,0.0,31,0.0
Qwen3-14B,0.0,6,0.0,4.54,0.0,None,79,0.0,30,0.0
Mistral-Small-3.1-24B-2503,0.0,6,0.0,1.82,0.0,None,79,0.0,32,0.0
Codestral-22B-v0.1,0.0,6,0.0,1.54,0.0,None,79,0.0,34,0.0
Qwen3-4B-Non-Thinking,0.0,6,0.0,1.83,0.0,None,79,0.0,38,0.0
Qwen3-32B-Non-Thinking,0.0,6,0.0,5.33,0.0,None,79,0.0,36,0.0
Qwen3-8B-Non-Thinking,0.0,6,0.0,3.53,0.0,None,79,0.0,37,0.0
DeepSeek-R1-Distill-Qwen-32B,0.0,6,0.0,6.69,0.0,None,79,0.0,39,0.0
Qwen3-30B-Non-Thinking,0.0,6,0.0,4.47,0.0,None,79,0.0,41,0.0
DeepSeek-R1-Distill-Llama-70B,0.0,6,0.0,3.31,0.0,None,79,0.0,40,0.0
OpenCodeReasoning-Nemotron-32B-IOI,0.0,6,0.0,7.22,0.0,None,79,0.0,43,0.0
Qwen2.5-Coder-7B-Instruct,0.0,6,0.0,1.11,0.0,None,79,0.0,42,0.0
Llama-3.3-70B-Instruct,0.0,6,0.0,1.76,0.0,None,79,0.0,43,0.0
Qwen3-8B,0.0,6,0.0,6.3,0.0,None,79,0.0,45,0.0
Qwen3-32B,0.0,6,0.0,9.62,0.0,None,79,0.0,44,0.0
DeepSeek-R1-Distill-Qwen-14B,0.0,6,0.0,4.59,0.0,None,79,0.0,46,0.0
Mistral-Large-Instruct-2411,0.0,6,0.0,4.26,0.0,None,79,0.0,47,0.0
