Model,Total Score,Included Tasks,Included Score,Avg Tests Passed,Pass Rate (%),Medal,Human Relative Rank,Human Percentile,Rank,Relative Score (%)
gpt-oss-120b-high,343.0,4,330.0,85.19,80.0,Gold,1,100.0,1,82.5
gpt-oss-20b-high,343.0,4,330.0,85.0,80.0,Gold,1,100.0,2,82.5
seed-oss_-1,343.0,4,330.0,85.19,80.0,Gold,1,100.0,3,82.5
gpt-oss-20b-medium,343.0,4,330.0,85.19,80.0,Gold,1,100.0,4,82.5
gpt-5,343.0,4,330.0,85.19,80.0,Gold,1,100.0,5,82.5
grok-4-fast-reasoning,330.0,4,330.0,100.0,100.0,Gold,1,100.0,6,82.5
gpt-oss-120b_sp1,330.0,4,317.0,81.94,60.0,Gold,2,99.31972789115646,7,79.25
gpt-oss-120b_sp3,300.0,4,287.0,83.33,60.0,Gold,2,99.31972789115646,8,71.75
gemini-2.5-pro,300.0,4,287.0,80.08,60.0,Gold,2,99.31972789115646,6,71.75
gpt-oss-120b_sp2,300.0,4,287.0,80.08,60.0,Gold,2,99.31972789115646,10,71.75
gpt-o3-mini-high,300.0,4,287.0,80.08,60.0,Gold,2,99.31972789115646,7,71.75
gpt-oss-120b-medium,287.0,4,287.0,74.88,60.0,Gold,2,99.31972789115646,8,71.75
gpt-oss-120b_sp4,291.0,4,278.0,75.7,60.0,Gold,3,98.63945578231292,13,69.5
seed-oss_16384,280.0,4,267.0,76.47,40.0,Gold,3,98.63945578231292,9,66.75
Qwen3-32B,265.0,4,265.0,67.44,60.0,Gold,3,98.63945578231292,11,66.25
gemini-2.5-flash,272.0,4,259.0,72.73,40.0,Gold,4,97.95918367346938,12,64.75
claude-sonnet-4.5,242.0,4,242.0,81.72,50.0,Gold,6,96.59863945578232,17,60.5
Qwen3-30B,253.0,4,240.0,66.2,40.0,Gold,6,96.59863945578232,13,60.0
Qwen3-14B,253.0,4,240.0,65.38,40.0,Gold,6,96.59863945578232,14,60.0
seed-oss_8192,242.0,4,229.0,67.41,40.0,Gold,8,95.23809523809524,15,57.25
gpt-oss-20b-low,227.0,4,227.0,64.32,20.0,Gold,8,95.23809523809524,16,56.75
gpt-oss-120b-low,235.0,4,222.0,68.6,40.0,Gold,8,95.23809523809524,17,55.5
gpt-4.1,214.0,4,201.0,62.5,20.0,Silver,12,92.51700680272108,18,50.25
deepseek-reasoner,194.0,4,181.0,59.55,20.0,Silver,16,89.79591836734694,19,45.25
seed-oss_2048,156.0,4,143.0,46.36,20.0,Silver,32,78.91156462585035,20,35.75
seed-oss_4096,148.0,4,135.0,38.27,20.0,Silver,36,76.19047619047619,21,33.75
Qwen3-8B,142.0,4,129.0,34.13,0.0,Bronze,41,72.78911564625851,22,32.25
OpenCodeReasoning-Nemotron-32B-IOI,141.0,4,128.0,46.38,0.0,Bronze,41,72.78911564625851,28,32.0
Qwen3-32B-Non-Thinking,103.0,4,103.0,23.01,0.0,Bronze,61,59.183673469387756,24,25.75
seed-oss_1024,74.0,4,61.0,14.75,0.0,None,90,39.45578231292517,26,15.25
deepseek-chat,74.0,4,61.0,30.05,0.0,None,90,39.45578231292517,27,15.25
Llama-4-Scout,61.0,4,61.0,14.53,0.0,None,90,39.45578231292517,28,15.25
seed-oss_512,67.0,4,54.0,24.26,0.0,None,97,34.69387755102041,29,13.5
Qwen2.5-Coder-14B-Instruct,51.0,4,51.0,16.35,0.0,None,99,33.333333333333336,30,12.75
Qwen3-4B-Non-Thinking,45.0,4,45.0,12.75,0.0,None,105,29.25170068027211,31,11.25
Qwen3-8B-Non-Thinking,54.0,4,41.0,25.1,0.0,None,109,26.53061224489796,32,10.25
OlympicCoder-7B,53.0,4,40.0,29.68,0.0,None,110,25.85034013605442,37,10.0
Qwen3-30B-Non-Thinking,33.0,4,33.0,14.12,0.0,None,112,24.489795918367346,33,8.25
Codestral-22B-v0.1,31.0,4,31.0,11.57,0.0,None,112,24.489795918367346,35,7.75
DeepSeek-R1-Distill-Llama-70B,44.0,4,31.0,15.08,0.0,None,112,24.489795918367346,34,7.75
Llama-3.3-70B-Instruct,31.0,4,31.0,21.47,0.0,None,112,24.489795918367346,36,7.75
Mistral-Large-Instruct-2411,31.0,4,31.0,12.09,0.0,None,112,24.489795918367346,37,7.75
Qwen3-4B,42.0,4,29.0,14.59,0.0,None,113,23.80952380952381,38,7.25
DeepSeek-Coder-V2-Lite-Instruct,38.0,4,25.0,14.12,0.0,None,115,22.448979591836736,39,6.25
Mistral-Small-3.1-24B-2503,36.0,4,23.0,10.31,0.0,None,116,21.768707482993197,40,5.75
seed-oss_0,19.0,4,19.0,11.69,0.0,None,117,21.08843537414966,41,4.75
Qwen2.5-72B,19.0,4,19.0,10.72,0.0,None,117,21.08843537414966,42,4.75
Qwen2.5-Coder-7B-Instruct,6.0,4,6.0,3.88,0.0,None,127,14.285714285714286,43,1.5
DeepSeek-R1-Distill-Llama-8B,13.0,4,0.0,5.0,0.0,None,134,9.523809523809524,45,0.0
QwQ-32B,278.0,4,265.0,72.63,60.0,Gold,3,98.63945578231292,10,66.25
DeepSeek-R1-Distill-Qwen-32B,83.0,4,70.0,29.65,0.0,None,78,47.61904761904762,25,17.5
DeepSeek-R1-Distill-Qwen-14B,13.0,4,0.0,5.0,0.0,None,134,9.523809523809524,44,0.0
Qwen2.5-Coder-32B-Instruct,122.0,4,122.0,31.36,0.0,Bronze,45,70.06802721088435,23,30.5
DeepSeek-R1-Distill-Qwen-7B,0.0,4,0.0,0.0,0.0,None,134,9.523809523809524,46,0.0
Qwen3-14B-Non-Thinking,0.0,4,0.0,4.99,0.0,None,134,9.523809523809524,47,0.0
Llama-3.1-8B-Instruct,0.0,4,0.0,2.34,0.0,None,134,9.523809523809524,48,0.0
