Model,Total Score,Included Tasks,Included Score,Avg Tests Passed,Pass Rate (%),Medal,Human Relative Rank,Human Percentile,Rank,Relative Score (%)
gpt-5,185.0,5,185.0,43.68,16.67,Bronze,16,68.75,1,37.0
gemini-2.5-pro,180.0,5,180.0,42.38,16.67,Bronze,16,68.75,2,36.0
grok-4-fast-reasoning,150.0,5,150.0,49.08,20.0,Bronze,19,62.5,3,30.0
gpt-oss-120b-high,130.0,5,130.0,37.42,0.0,Bronze,24,52.083333333333336,3,26.0
gpt-oss-120b_sp1,60.0,5,60.0,18.84,0.0,None,33,33.333333333333336,5,12.0
gemini-2.5-flash,45.0,5,45.0,23.86,0.0,None,36,27.083333333333332,4,9.0
gpt-oss-120b_sp3,40.0,5,40.0,11.65,0.0,None,36,27.083333333333332,7,8.0
gpt-oss-120b-medium,40.0,5,40.0,12.5,0.0,None,36,27.083333333333332,5,8.0
claude-sonnet-4.5,40.0,5,40.0,24.68,0.0,None,36,27.083333333333332,9,8.0
gpt-o3-mini-high,30.0,5,30.0,10.19,0.0,None,39,20.833333333333332,6,6.0
seed-oss_-1,20.0,5,20.0,14.24,0.0,None,43,12.5,7,4.0
gpt-oss-120b_sp2,20.0,5,20.0,7.34,0.0,None,43,12.5,12,4.0
gpt-oss-20b-high,15.0,5,15.0,13.22,0.0,None,45,8.333333333333334,8,3.0
QwQ-32B,10.0,5,10.0,2.77,0.0,None,45,8.333333333333334,12,2.0
Qwen3-32B-Non-Thinking,10.0,5,10.0,1.68,0.0,None,45,8.333333333333334,9,2.0
gpt-oss-120b_sp4,10.0,5,10.0,2.15,0.0,None,45,8.333333333333334,16,2.0
gpt-oss-20b-medium,10.0,5,10.0,2.09,0.0,None,45,8.333333333333334,13,2.0
deepseek-reasoner,10.0,5,10.0,5.97,0.0,None,45,8.333333333333334,15,2.0
gpt-4.1,10.0,5,10.0,7.09,0.0,None,45,8.333333333333334,16,2.0
deepseek-chat,10.0,5,10.0,9.35,0.0,None,45,8.333333333333334,17,2.0
DeepSeek-R1-Distill-Qwen-32B,0.0,5,0.0,0.44,0.0,None,49,0.0,41,0.0
Llama-4-Scout,0.0,5,0.0,0.0,0.0,None,49,0.0,19,0.0
DeepSeek-R1-Distill-Qwen-14B,0.0,5,0.0,0.67,0.0,None,49,0.0,40,0.0
DeepSeek-R1-Distill-Qwen-7B,0.0,5,0.0,0.0,0.0,None,49,0.0,18,0.0
Llama-3.1-8B-Instruct,0.0,5,0.0,0.22,0.0,None,49,0.0,20,0.0
Qwen3-8B-Non-Thinking,0.0,5,0.0,0.22,0.0,None,49,0.0,26,0.0
Qwen3-14B-Non-Thinking,0.0,5,0.0,0.44,0.0,None,49,0.0,30,0.0
OlympicCoder-7B,0.0,5,0.0,0.67,0.0,None,49,0.0,28,0.0
Qwen2.5-Coder-32B-Instruct,0.0,5,0.0,0.69,0.0,None,49,0.0,33,0.0
OpenCodeReasoning-Nemotron-32B-IOI,0.0,5,0.0,1.41,0.0,None,49,0.0,30,0.0
Qwen2.5-Coder-14B-Instruct,0.0,5,0.0,0.22,0.0,None,49,0.0,29,0.0
Qwen3-4B,0.0,5,0.0,0.57,0.0,None,49,0.0,25,0.0
gpt-oss-20b-low,0.0,5,0.0,0.81,0.0,None,49,0.0,21,0.0
Qwen3-8B,0.0,5,0.0,1.53,0.0,None,49,0.0,32,0.0
Mistral-Small-3.1-24B-2503,0.0,5,0.0,0.44,0.0,None,49,0.0,35,0.0
DeepSeek-R1-Distill-Llama-8B,0.0,5,0.0,0.27,0.0,None,49,0.0,37,0.0
DeepSeek-Coder-V2-Lite-Instruct,0.0,5,0.0,1.56,0.0,None,49,0.0,38,0.0
Mistral-Large-Instruct-2411,0.0,5,0.0,0.69,0.0,None,49,0.0,39,0.0
Qwen2.5-Coder-7B-Instruct,0.0,5,0.0,0.79,0.0,None,49,0.0,42,0.0
DeepSeek-R1-Distill-Llama-70B,0.0,5,0.0,0.91,0.0,None,49,0.0,43,0.0
gpt-oss-120b-low,0.0,5,0.0,5.75,0.0,None,49,0.0,44,0.0
Codestral-22B-v0.1,0.0,5,0.0,0.44,0.0,None,49,0.0,45,0.0
Qwen3-4B-Non-Thinking,0.0,5,0.0,0.57,0.0,None,49,0.0,46,0.0
Qwen3-30B-Non-Thinking,0.0,5,0.0,4.33,0.0,None,49,0.0,48,0.0
