Model,Total Score,Included Tasks,Included Score,Avg Tests Passed,Pass Rate (%),Medal,Human Relative Rank,Human Percentile,Rank,Relative Score (%)
gemini-2.5-pro,297.0,5,297.0,78.7,40.0,Silver,49,83.27526132404181,1,59.4
gpt-5,273.0,5,273.0,79.97,40.0,Silver,60,79.44250871080139,2,54.6
gpt-oss-20b-high,257.0,5,257.0,67.62,40.0,Bronze,76,73.86759581881533,3,51.4
gemini-2.5-flash,237.0,5,237.0,56.0,20.0,Bronze,81,72.12543554006969,4,47.4
seed-oss_16384,223.0,5,223.0,71.16,40.0,Bronze,90,68.98954703832753,6,44.6
gpt-oss-120b_sp4,223.0,5,223.0,64.11,40.0,Bronze,90,68.98954703832753,6,44.6
gpt-o3-mini-high,223.0,5,223.0,64.11,40.0,Bronze,90,68.98954703832753,5,44.6
gpt-oss-120b_sp2,207.0,5,207.0,51.76,40.0,Bronze,103,64.45993031358886,8,41.4
gpt-oss-120b_sp3,207.0,5,207.0,51.76,40.0,Bronze,103,64.45993031358886,9,41.4
gpt-oss-120b-medium,207.0,5,207.0,51.76,40.0,Bronze,103,64.45993031358886,9,41.4
QwQ-32B,207.0,5,207.0,51.76,40.0,Bronze,103,64.45993031358886,13,41.4
gpt-oss-120b-low,207.0,5,207.0,51.76,40.0,Bronze,103,64.45993031358886,7,41.4
seed-oss_-1,207.0,5,207.0,58.99,40.0,Bronze,103,64.45993031358886,8,41.4
Qwen3-32B,207.0,5,207.0,53.48,40.0,Bronze,103,64.45993031358886,12,41.4
gpt-oss-120b-high,207.0,5,207.0,60.71,40.0,Bronze,103,64.45993031358886,11,41.4
gpt-oss-20b-medium,207.0,5,207.0,51.76,40.0,Bronze,103,64.45993031358886,10,41.4
gpt-oss-120b_sp1,207.0,5,207.0,53.86,40.0,Bronze,103,64.45993031358886,17,41.4
grok-4-fast-reasoning,188.0,5,188.0,77.31,25.0,None,164,43.20557491289198,18,37.6
deepseek-chat,150.0,5,150.0,40.49,20.0,None,184,36.23693379790941,14,30.0
DeepSeek-R1-Distill-Llama-70B,115.0,5,115.0,38.06,20.0,None,196,32.055749128919864,15,23.0
seed-oss_8192,107.0,5,107.0,32.55,20.0,None,197,31.70731707317073,17,21.4
DeepSeek-R1-Distill-Qwen-32B,107.0,5,107.0,31.76,20.0,None,197,31.70731707317073,16,21.4
deepseek-reasoner,107.0,5,107.0,32.55,20.0,None,197,31.70731707317073,18,21.4
OpenCodeReasoning-Nemotron-32B-IOI,107.0,5,107.0,32.55,20.0,None,197,31.70731707317073,24,21.4
seed-oss_0,100.0,5,100.0,20.0,20.0,None,204,29.26829268292683,19,20.0
gpt-oss-20b-low,100.0,5,100.0,25.7,20.0,None,204,29.26829268292683,20,20.0
claude-sonnet-4.5,86.0,5,86.0,45.95,0.0,None,222,22.99651567944251,27,17.2
gpt-4.1,67.0,5,67.0,46.98,0.0,None,242,16.027874564459932,21,13.4
Qwen2.5-Coder-32B-Instruct,53.0,5,53.0,18.44,0.0,None,245,14.982578397212544,22,10.6
DeepSeek-Coder-V2-Lite-Instruct,24.0,5,24.0,7.52,0.0,None,276,4.181184668989547,23,4.8
Qwen3-14B,15.0,5,15.0,18.85,0.0,None,277,3.832752613240418,25,3.0
Qwen3-4B,15.0,5,15.0,18.85,0.0,None,277,3.832752613240418,24,3.0
Qwen3-14B-Non-Thinking,0.0,5,0.0,0.0,0.0,None,288,0.0,40,0.0
Qwen3-30B,15.0,5,15.0,21.73,0.0,None,277,3.832752613240418,26,3.0
Qwen2.5-Coder-14B-Instruct,12.0,5,12.0,16.71,0.0,None,277,3.832752613240418,27,2.4
Codestral-22B-v0.1,8.0,5,8.0,0.42,0.0,None,278,3.484320557491289,29,1.6
Qwen3-32B-Non-Thinking,0.0,5,0.0,0.0,0.0,None,288,0.0,41,0.0
seed-oss_1024,8.0,5,8.0,4.79,0.0,None,278,3.484320557491289,32,1.6
Qwen2.5-72B,8.0,5,8.0,6.11,0.0,None,278,3.484320557491289,28,1.6
Qwen3-30B-Non-Thinking,8.0,5,8.0,3.78,0.0,None,278,3.484320557491289,30,1.6
Qwen3-4B-Non-Thinking,8.0,5,8.0,1.2,0.0,None,278,3.484320557491289,31,1.6
Llama-3.3-70B-Instruct,8.0,5,8.0,4.59,0.0,None,278,3.484320557491289,33,1.6
Qwen3-8B,8.0,5,8.0,26.0,0.0,None,278,3.484320557491289,34,1.6
Mistral-Large-Instruct-2411,8.0,5,8.0,6.14,0.0,None,278,3.484320557491289,37,1.6
Mistral-Small-3.1-24B-2503,8.0,5,8.0,6.72,0.0,None,278,3.484320557491289,36,1.6
Llama-4-Scout,8.0,5,8.0,6.93,0.0,None,278,3.484320557491289,35,1.6
seed-oss_4096,7.0,5,7.0,12.55,0.0,None,278,3.484320557491289,38,1.4
Qwen3-8B-Non-Thinking,4.0,5,4.0,12.45,0.0,None,288,0.0,39,0.8
DeepSeek-R1-Distill-Qwen-7B,0.0,5,0.0,0.0,0.0,None,288,0.0,42,0.0
DeepSeek-R1-Distill-Llama-8B,0.0,5,0.0,0.0,0.0,None,288,0.0,43,0.0
Qwen2.5-Coder-7B-Instruct,0.0,5,0.0,0.0,0.0,None,288,0.0,44,0.0
Llama-3.1-8B-Instruct,0.0,5,0.0,0.0,0.0,None,288,0.0,45,0.0
seed-oss_512,0.0,5,0.0,0.0,0.0,None,288,0.0,46,0.0
OlympicCoder-7B,0.0,5,0.0,0.0,0.0,None,288,0.0,54,0.0
seed-oss_2048,0.0,5,0.0,0.78,0.0,None,288,0.0,47,0.0
DeepSeek-R1-Distill-Qwen-14B,0.0,5,0.0,8.37,0.0,None,288,0.0,48,0.0
