Model,Total Score,Included Tasks,Included Score,Avg Tests Passed,Pass Rate (%),Medal,Human Relative Rank,Human Percentile,Rank,Relative Score (%)
gpt-oss-20b-high,450.0,5,450.0,100.0,100.0,Gold,1,100.0,1,90.0
gemini-2.5-pro,450.0,5,450.0,100.0,100.0,Gold,1,100.0,2,90.0
gemini-2.5-flash,450.0,5,450.0,100.0,100.0,Gold,1,100.0,3,90.0
gpt-o3-mini-high,450.0,5,450.0,100.0,100.0,Gold,1,100.0,4,90.0
grok-4-fast-reasoning,450.0,5,450.0,100.0,100.0,Gold,1,100.0,5,90.0
gpt-oss-20b-medium,415.0,5,415.0,88.0,80.0,Gold,15,94.44444444444444,5,83.0
gpt-oss-120b_sp1,400.0,5,400.0,100.0,100.0,Gold,15,94.44444444444444,7,80.0
gpt-oss-120b-high,400.0,5,400.0,80.0,80.0,Gold,15,94.44444444444444,7,80.0
gpt-oss-120b_sp4,400.0,5,400.0,80.0,80.0,Gold,15,94.44444444444444,9,80.0
gpt-5,400.0,5,400.0,80.0,80.0,Gold,15,94.44444444444444,8,80.0
gpt-oss-120b-medium,400.0,5,400.0,80.0,80.0,Gold,15,94.44444444444444,9,80.0
seed-oss_-1,400.0,5,400.0,100.0,100.0,Gold,15,94.44444444444444,6,80.0
gpt-oss-120b_sp3,400.0,5,400.0,80.0,80.0,Gold,15,94.44444444444444,13,80.0
gpt-oss-120b_sp2,400.0,5,400.0,80.0,80.0,Gold,15,94.44444444444444,14,80.0
gpt-oss-120b-low,400.0,5,400.0,80.0,80.0,Gold,15,94.44444444444444,10,80.0
claude-sonnet-4.5,326.0,5,326.0,77.44,60.0,Silver,34,86.9047619047619,16,65.2
gpt-4.1,276.0,5,276.0,57.44,40.0,Silver,45,82.53968253968254,11,55.2
seed-oss_8192,270.0,5,270.0,52.78,40.0,Silver,45,82.53968253968254,12,54.0
gpt-oss-20b-low,260.0,5,260.0,68.0,60.0,Silver,45,82.53968253968254,13,52.0
deepseek-reasoner,255.0,5,255.0,48.0,40.0,Silver,55,78.57142857142857,14,51.0
Qwen3-30B,255.0,5,255.0,48.0,40.0,Silver,55,78.57142857142857,15,51.0
OpenCodeReasoning-Nemotron-32B-IOI,220.0,5,220.0,49.33,40.0,Bronze,73,71.42857142857143,22,44.0
deepseek-chat,211.0,5,211.0,45.44,20.0,Bronze,78,69.44444444444444,16,42.2
Qwen3-32B,196.0,5,196.0,48.11,20.0,Bronze,83,67.46031746031746,17,39.2
seed-oss_4096,180.0,5,180.0,40.0,40.0,Bronze,91,64.28571428571429,18,36.0
Qwen3-8B,175.0,5,175.0,38.67,20.0,Bronze,91,64.28571428571429,19,35.0
QwQ-32B,156.0,5,156.0,33.33,0.0,Bronze,97,61.904761904761905,20,31.2
Qwen3-14B,145.0,5,145.0,33.33,20.0,Bronze,102,59.92063492063492,21,29.0
seed-oss_1024,110.0,5,110.0,21.33,20.0,None,127,50.0,22,22.0
DeepSeek-R1-Distill-Qwen-32B,109.0,5,109.0,25.6,0.0,None,129,49.20634920634921,23,21.8
Qwen2.5-Coder-32B-Instruct,95.0,5,95.0,18.67,0.0,None,135,46.82539682539682,24,19.0
Qwen3-4B,80.0,5,80.0,17.33,0.0,None,164,35.317460317460316,25,16.0
DeepSeek-R1-Distill-Llama-70B,80.0,5,80.0,21.33,0.0,None,164,35.317460317460316,26,16.0
Qwen2.5-72B,69.0,5,69.0,18.33,0.0,None,172,32.142857142857146,27,13.8
Qwen2.5-Coder-14B-Instruct,65.0,5,65.0,13.33,0.0,None,173,31.746031746031747,28,13.0
Qwen3-30B-Non-Thinking,46.0,5,46.0,8.0,0.0,None,198,21.825396825396826,29,9.2
DeepSeek-R1-Distill-Qwen-14B,45.0,5,45.0,8.0,0.0,None,198,21.825396825396826,30,9.0
Qwen3-8B-Non-Thinking,39.0,5,39.0,8.0,0.0,None,201,20.634920634920636,32,7.8
Qwen3-32B-Non-Thinking,0.0,5,0.0,0.0,0.0,None,242,4.365079365079365,43,0.0
Mistral-Large-Instruct-2411,39.0,5,39.0,9.33,0.0,None,201,20.634920634920636,31,7.8
Qwen2.5-Coder-7B-Instruct,39.0,5,39.0,8.0,0.0,None,201,20.634920634920636,33,7.8
Codestral-22B-v0.1,39.0,5,39.0,9.33,0.0,None,201,20.634920634920636,34,7.8
Llama-3.3-70B-Instruct,39.0,5,39.0,12.0,0.0,None,201,20.634920634920636,35,7.8
Mistral-Small-3.1-24B-2503,0.0,5,0.0,0.0,0.0,None,242,4.365079365079365,42,0.0
Qwen3-14B-Non-Thinking,20.0,5,20.0,6.67,0.0,None,216,14.682539682539682,36,4.0
Llama-4-Scout,19.0,5,19.0,4.0,0.0,None,217,14.285714285714286,37,3.8
DeepSeek-Coder-V2-Lite-Instruct,0.0,5,0.0,0.0,0.0,None,242,4.365079365079365,44,0.0
Qwen3-4B-Non-Thinking,15.0,5,15.0,4.0,0.0,None,235,7.142857142857143,38,3.0
OlympicCoder-7B,15.0,5,15.0,8.0,0.0,None,235,7.142857142857143,49,3.0
seed-oss_16384,0.0,5,0.0,0.0,0.0,None,242,4.365079365079365,39,0.0
seed-oss_2048,0.0,5,0.0,0.0,0.0,None,242,4.365079365079365,40,0.0
seed-oss_512,0.0,5,0.0,0.0,0.0,None,242,4.365079365079365,41,0.0
DeepSeek-R1-Distill-Qwen-7B,0.0,5,0.0,0.0,0.0,None,242,4.365079365079365,45,0.0
DeepSeek-R1-Distill-Llama-8B,0.0,5,0.0,0.0,0.0,None,242,4.365079365079365,46,0.0
Llama-3.1-8B-Instruct,0.0,5,0.0,0.0,0.0,None,242,4.365079365079365,47,0.0
seed-oss_0,0.0,5,0.0,0.0,0.0,None,242,4.365079365079365,48,0.0
