Model,Total Score,Included Tasks,Included Score,Avg Tests Passed,Pass Rate (%),Medal,Human Relative Rank,Human Percentile,Rank,Relative Score (%)
gpt-5,206.0,3,206.0,78.86,66.67,Silver,15,92.3913043478261,1,68.67
gpt-oss-120b-high,108.0,3,108.0,63.75,33.33,Bronze,58,69.02173913043478,2,36.0
gpt-oss-20b-high,106.0,3,106.0,45.52,33.33,Bronze,58,69.02173913043478,3,35.33
grok-4-fast-reasoning,106.0,3,106.0,51.65,33.33,Bronze,58,69.02173913043478,4,35.33
gpt-oss-120b_sp2,106.0,3,106.0,47.75,33.33,Bronze,58,69.02173913043478,5,35.33
gemini-2.5-pro,47.0,3,47.0,59.0,0.0,None,102,45.108695652173914,4,15.67
seed-oss_8192,28.0,3,28.0,34.32,0.0,None,125,32.608695652173914,8,9.33
deepseek-chat,28.0,3,28.0,42.26,0.0,None,125,32.608695652173914,5,9.33
seed-oss_-1,28.0,3,28.0,48.6,0.0,None,125,32.608695652173914,7,9.33
Qwen3-30B,28.0,3,28.0,43.05,0.0,None,125,32.608695652173914,9,9.33
Qwen3-32B,28.0,3,28.0,43.05,0.0,None,125,32.608695652173914,6,9.33
Qwen3-14B,28.0,3,28.0,43.05,0.0,None,125,32.608695652173914,10,9.33
gpt-oss-120b_sp3,26.0,3,26.0,34.9,0.0,None,127,31.52173913043478,13,8.67
seed-oss_16384,26.0,3,26.0,29.19,0.0,None,127,31.52173913043478,12,8.67
seed-oss_2048,26.0,3,26.0,27.81,0.0,None,127,31.52173913043478,15,8.67
gpt-oss-120b_sp1,26.0,3,26.0,33.21,0.0,None,127,31.52173913043478,16,8.67
gpt-o3-mini-high,26.0,3,26.0,33.56,0.0,None,127,31.52173913043478,11,8.67
Qwen3-8B,26.0,3,26.0,25.59,0.0,None,127,31.52173913043478,13,8.67
OpenCodeReasoning-Nemotron-32B-IOI,26.0,3,26.0,25.59,0.0,None,127,31.52173913043478,19,8.67
gpt-oss-120b-medium,26.0,3,26.0,31.71,0.0,None,127,31.52173913043478,14,8.67
gpt-oss-120b-low,26.0,3,26.0,34.9,0.0,None,127,31.52173913043478,16,8.67
gpt-oss-120b_sp4,26.0,3,26.0,32.48,0.0,None,127,31.52173913043478,22,8.67
gpt-oss-20b-medium,26.0,3,26.0,39.15,0.0,None,127,31.52173913043478,17,8.67
claude-sonnet-4.5,22.0,3,22.0,36.75,0.0,None,129,30.434782608695652,24,7.33
Qwen3-14B-Non-Thinking,21.0,3,21.0,33.26,0.0,None,142,23.369565217391305,18,7.0
Llama-3.3-70B-Instruct,21.0,3,21.0,33.26,0.0,None,142,23.369565217391305,19,7.0
gpt-oss-20b-low,20.0,3,20.0,20.35,0.0,None,154,16.847826086956523,23,6.67
gpt-4.1,20.0,3,20.0,26.47,0.0,None,154,16.847826086956523,21,6.67
DeepSeek-R1-Distill-Qwen-32B,20.0,3,20.0,18.0,0.0,None,154,16.847826086956523,22,6.67
gemini-2.5-flash,20.0,3,20.0,33.56,0.0,None,154,16.847826086956523,20,6.67
seed-oss_4096,6.0,3,6.0,14.41,0.0,None,179,3.260869565217391,25,2.0
Qwen3-4B,6.0,3,6.0,12.19,0.0,None,179,3.260869565217391,24,2.0
DeepSeek-R1-Distill-Qwen-14B,6.0,3,6.0,14.41,0.0,None,179,3.260869565217391,26,2.0
deepseek-reasoner,6.0,3,6.0,18.31,0.0,None,179,3.260869565217391,27,2.0
QwQ-32B,6.0,3,6.0,20.09,0.0,None,179,3.260869565217391,28,2.0
Llama-4-Scout,2.0,3,2.0,24.41,0.0,None,183,1.0869565217391304,29,0.67
Qwen3-30B-Non-Thinking,2.0,3,2.0,25.18,0.0,None,183,1.0869565217391304,30,0.67
DeepSeek-Coder-V2-Lite-Instruct,0.0,3,0.0,6.95,0.0,None,185,0.0,31,0.0
seed-oss_0,0.0,3,0.0,7.67,0.0,None,185,0.0,35,0.0
seed-oss_512,0.0,3,0.0,9.89,0.0,None,185,0.0,36,0.0
Qwen2.5-72B,0.0,3,0.0,13.07,0.0,None,185,0.0,37,0.0
Codestral-22B-v0.1,0.0,3,0.0,9.17,0.0,None,185,0.0,38,0.0
DeepSeek-R1-Distill-Qwen-7B,0.0,3,0.0,6.95,0.0,None,185,0.0,32,0.0
DeepSeek-R1-Distill-Llama-8B,0.0,3,0.0,6.95,0.0,None,185,0.0,40,0.0
Mistral-Small-3.1-24B-2503,0.0,3,0.0,11.2,0.0,None,185,0.0,41,0.0
OlympicCoder-7B,0.0,3,0.0,6.95,0.0,None,185,0.0,46,0.0
Qwen3-8B-Non-Thinking,0.0,3,0.0,0.0,0.0,None,185,0.0,34,0.0
DeepSeek-R1-Distill-Llama-70B,0.0,3,0.0,9.17,0.0,None,185,0.0,39,0.0
Qwen2.5-Coder-32B-Instruct,0.0,3,0.0,9.17,0.0,None,185,0.0,42,0.0
Qwen3-32B-Non-Thinking,0.0,3,0.0,0.0,0.0,None,185,0.0,33,0.0
Qwen3-4B-Non-Thinking,0.0,3,0.0,11.74,0.0,None,185,0.0,43,0.0
Llama-3.1-8B-Instruct,0.0,3,0.0,8.84,0.0,None,185,0.0,44,0.0
Mistral-Large-Instruct-2411,0.0,3,0.0,10.67,0.0,None,185,0.0,45,0.0
Qwen2.5-Coder-14B-Instruct,0.0,3,0.0,17.19,0.0,None,185,0.0,46,0.0
seed-oss_1024,0.0,3,0.0,11.39,0.0,None,185,0.0,48,0.0
Qwen2.5-Coder-7B-Instruct,0.0,3,0.0,10.31,0.0,None,185,0.0,47,0.0
