Model,Total Score,Included Tasks,Included Score,Avg Tests Passed,Pass Rate (%),Medal,Human Relative Rank,Human Percentile,Rank,Relative Score (%)
gpt-5,161.0,3,161.0,75.19,33.33,Silver,8,87.93103448275862,1,53.67
grok-4-fast-reasoning,154.0,3,154.0,70.42,33.33,Silver,8,87.93103448275862,2,51.33
claude-sonnet-4.5,154.0,3,154.0,70.42,33.33,Silver,8,87.93103448275862,3,51.33
gemini-2.5-pro,154.0,3,154.0,70.42,33.33,Silver,8,87.93103448275862,2,51.33
gpt-oss-120b-high,149.0,3,149.0,68.81,33.33,Silver,8,87.93103448275862,3,49.67
gpt-o3-mini-high,139.0,3,139.0,67.05,33.33,Silver,9,86.20689655172414,4,46.33
gemini-2.5-flash,137.0,3,137.0,63.75,33.33,Silver,9,86.20689655172414,5,45.67
gpt-oss-120b_sp3,129.0,3,129.0,66.45,33.33,Silver,11,82.75862068965517,8,43.0
gpt-4.1,125.0,3,125.0,58.71,33.33,Silver,11,82.75862068965517,6,41.67
gpt-oss-120b_sp1,117.0,3,117.0,47.34,33.33,Silver,13,79.3103448275862,10,39.0
gpt-oss-120b_sp2,112.0,3,112.0,54.89,33.33,Bronze,17,72.41379310344827,11,37.33
seed-oss_8192,109.0,3,109.0,53.0,33.33,Bronze,19,68.96551724137932,7,36.33
seed-oss_-1,109.0,3,109.0,53.0,33.33,Bronze,19,68.96551724137932,8,36.33
gpt-oss-20b-high,109.0,3,109.0,46.31,33.33,Bronze,19,68.96551724137932,9,36.33
gpt-oss-120b_sp4,109.0,3,109.0,46.31,33.33,Bronze,19,68.96551724137932,15,36.33
gpt-oss-20b-medium,105.0,3,105.0,36.25,33.33,Bronze,22,63.793103448275865,10,35.0
gpt-oss-120b-medium,104.0,3,104.0,45.58,33.33,Bronze,22,63.793103448275865,11,34.67
gpt-oss-120b-low,104.0,3,104.0,52.27,33.33,Bronze,22,63.793103448275865,12,34.67
seed-oss_4096,100.0,3,100.0,33.33,33.33,Bronze,24,60.3448275862069,13,33.33
seed-oss_16384,100.0,3,100.0,35.29,33.33,Bronze,24,60.3448275862069,15,33.33
gpt-oss-20b-low,100.0,3,100.0,35.85,33.33,Bronze,24,60.3448275862069,14,33.33
Qwen3-14B,62.0,3,62.0,53.71,0.0,Bronze,26,56.89655172413793,16,20.67
Qwen3-30B,55.0,3,55.0,47.81,0.0,Bronze,27,55.172413793103445,17,18.33
DeepSeek-R1-Distill-Llama-70B,43.0,3,43.0,45.61,0.0,None,31,48.275862068965516,18,14.33
Qwen2.5-72B,35.0,3,35.0,27.84,0.0,None,34,43.10344827586207,19,11.67
Qwen3-32B,32.0,3,32.0,39.66,0.0,None,34,43.10344827586207,20,10.67
Llama-4-Scout,23.0,3,23.0,30.05,0.0,None,37,37.93103448275862,21,7.67
Qwen3-8B,23.0,3,23.0,27.34,0.0,None,37,37.93103448275862,22,7.67
deepseek-reasoner,17.0,3,17.0,36.02,0.0,None,41,31.03448275862069,23,5.67
QwQ-32B,9.0,3,9.0,31.33,0.0,None,42,29.310344827586206,25,3.0
Qwen3-4B,9.0,3,9.0,18.53,0.0,None,42,29.310344827586206,24,3.0
OlympicCoder-7B,9.0,3,9.0,28.44,0.0,None,42,29.310344827586206,32,3.0
Qwen2.5-Coder-14B-Instruct,5.0,3,5.0,12.23,0.0,None,42,29.310344827586206,26,1.67
seed-oss_2048,4.0,3,4.0,21.52,0.0,None,43,27.586206896551722,28,1.33
seed-oss_512,4.0,3,4.0,26.35,0.0,None,43,27.586206896551722,29,1.33
deepseek-chat,4.0,3,4.0,24.03,0.0,None,43,27.586206896551722,27,1.33
DeepSeek-R1-Distill-Qwen-32B,4.0,3,4.0,22.18,0.0,None,43,27.586206896551722,30,1.33
Qwen3-30B-Non-Thinking,4.0,3,4.0,18.37,0.0,None,43,27.586206896551722,31,1.33
Llama-3.3-70B-Instruct,4.0,3,4.0,24.49,0.0,None,43,27.586206896551722,32,1.33
Codestral-22B-v0.1,0.0,3,0.0,0.0,0.0,None,45,24.137931034482758,33,0.0
DeepSeek-R1-Distill-Qwen-7B,0.0,3,0.0,0.0,0.0,None,45,24.137931034482758,34,0.0
DeepSeek-R1-Distill-Llama-8B,0.0,3,0.0,0.0,0.0,None,45,24.137931034482758,35,0.0
seed-oss_0,0.0,3,0.0,6.21,0.0,None,45,24.137931034482758,36,0.0
seed-oss_1024,0.0,3,0.0,12.9,0.0,None,45,24.137931034482758,37,0.0
Qwen3-8B-Non-Thinking,0.0,3,0.0,0.0,0.0,None,45,24.137931034482758,38,0.0
Llama-3.1-8B-Instruct,0.0,3,0.0,0.65,0.0,None,45,24.137931034482758,39,0.0
Qwen3-4B-Non-Thinking,0.0,3,0.0,7.34,0.0,None,45,24.137931034482758,41,0.0
Mistral-Small-3.1-24B-2503,0.0,3,0.0,12.9,0.0,None,45,24.137931034482758,42,0.0
Mistral-Large-Instruct-2411,0.0,3,0.0,12.34,0.0,None,45,24.137931034482758,45,0.0
Qwen2.5-Coder-32B-Instruct,0.0,3,0.0,7.34,0.0,None,45,24.137931034482758,43,0.0
Qwen3-32B-Non-Thinking,0.0,3,0.0,12.9,0.0,None,45,24.137931034482758,40,0.0
DeepSeek-Coder-V2-Lite-Instruct,0.0,3,0.0,0.65,0.0,None,45,24.137931034482758,44,0.0
DeepSeek-R1-Distill-Qwen-14B,0.0,3,0.0,8.07,0.0,None,45,24.137931034482758,46,0.0
Qwen3-14B-Non-Thinking,0.0,3,0.0,5.56,0.0,None,45,24.137931034482758,47,0.0
OpenCodeReasoning-Nemotron-32B-IOI,0.0,3,0.0,16.67,0.0,None,45,24.137931034482758,55,0.0
