Model,Total Score,Included Tasks,Included Score,Avg Tests Passed,Pass Rate (%),Medal,Human Relative Rank,Human Percentile,Rank,Relative Score (%)
gpt-oss-120b-high,79.0,6,79.0,53.78,33.33,Gold,4,83.33333333333333,1,52.67
gpt-5,62.0,6,62.0,47.24,33.33,Silver,9,55.55555555555556,2,41.33
gpt-oss-120b-medium,60.0,6,60.0,45.74,33.33,Silver,11,44.44444444444444,3,40.0
gpt-oss-120b_sp4,53.0,6,53.0,40.93,33.33,Bronze,14,27.77777777777778,4,35.33
gpt-oss-120b_sp3,53.0,6,53.0,41.19,33.33,Bronze,14,27.77777777777778,5,35.33
grok-4-fast-reasoning,40.0,6,40.0,34.11,16.67,Bronze,16,16.666666666666668,6,26.67
gpt-o3-mini-high,38.0,6,38.0,34.95,16.67,Bronze,16,16.666666666666668,4,25.33
Qwen3-32B,36.0,6,36.0,31.09,16.67,Bronze,17,11.11111111111111,5,24.0
Qwen3-14B,36.0,6,36.0,28.75,16.67,Bronze,17,11.11111111111111,6,24.0
Qwen3-8B,34.0,6,34.0,26.66,16.67,Bronze,17,11.11111111111111,7,22.67
gpt-oss-120b_sp1,33.0,6,33.0,36.15,16.67,Bronze,17,11.11111111111111,11,22.0
gpt-oss-120b_sp2,31.0,6,31.0,34.59,16.67,Bronze,17,11.11111111111111,12,20.67
seed-oss_16384,31.0,6,31.0,29.16,16.67,Bronze,17,11.11111111111111,8,20.67
deepseek-reasoner,31.0,6,31.0,27.76,16.67,Bronze,17,11.11111111111111,9,20.67
OpenCodeReasoning-Nemotron-32B-IOI,31.0,6,31.0,26.01,16.67,Bronze,17,11.11111111111111,15,20.67
QwQ-32B,31.0,6,31.0,26.83,0.0,Bronze,17,11.11111111111111,10,20.67
gpt-oss-20b-high,30.0,6,30.0,84.91,50.0,Bronze,17,11.11111111111111,11,20.0
gpt-oss-20b-low,28.0,6,28.0,37.87,20.0,Bronze,18,5.555555555555555,12,18.67
seed-oss_-1,28.0,6,28.0,31.19,16.67,Bronze,18,5.555555555555555,13,18.67
gpt-oss-20b-medium,28.0,6,28.0,29.43,16.67,Bronze,18,5.555555555555555,14,18.67
seed-oss_8192,28.0,6,28.0,30.07,16.67,Bronze,18,5.555555555555555,15,18.67
gemini-2.5-flash,28.0,6,28.0,30.81,16.67,Bronze,18,5.555555555555555,16,18.67
gpt-oss-120b-low,26.0,6,26.0,31.89,16.67,Bronze,18,5.555555555555555,17,17.33
Qwen3-30B,26.0,6,26.0,23.9,16.67,Bronze,18,5.555555555555555,18,17.33
gemini-2.5-pro,26.0,6,26.0,32.8,16.67,Bronze,18,5.555555555555555,19,17.33
claude-sonnet-4.5,26.0,6,26.0,28.92,16.67,Bronze,18,5.555555555555555,26,17.33
Qwen3-4B,19.0,6,19.0,28.79,0.0,None,19,0.0,20,12.67
DeepSeek-R1-Distill-Qwen-32B,14.0,6,14.0,13.57,0.0,None,19,0.0,21,9.33
gpt-4.1,13.0,6,13.0,16.08,0.0,None,19,0.0,22,8.67
seed-oss_2048,9.0,6,9.0,9.4,0.0,None,19,0.0,23,6.0
Qwen3-8B-Non-Thinking,9.0,6,9.0,9.16,0.0,None,19,0.0,24,6.0
Mistral-Large-Instruct-2411,6.0,6,6.0,8.36,0.0,None,19,0.0,25,4.0
DeepSeek-R1-Distill-Llama-70B,6.0,6,6.0,10.72,0.0,None,19,0.0,26,4.0
OlympicCoder-7B,5.0,6,5.0,11.94,0.0,None,19,0.0,34,3.33
Qwen2.5-Coder-32B-Instruct,3.0,6,3.0,6.94,0.0,None,19,0.0,27,2.0
seed-oss_0,1.0,6,1.0,5.11,0.0,None,19,0.0,28,0.67
Codestral-22B-v0.1,1.0,6,1.0,5.75,0.0,None,19,0.0,29,0.67
Mistral-Small-3.1-24B-2503,1.0,6,1.0,2.48,0.0,None,19,0.0,30,0.67
Llama-3.1-8B-Instruct,1.0,6,1.0,4.44,0.0,None,19,0.0,31,0.67
seed-oss_512,1.0,6,1.0,7.3,0.0,None,19,0.0,32,0.67
seed-oss_1024,1.0,6,1.0,8.57,0.0,None,19,0.0,33,0.67
seed-oss_4096,1.0,6,1.0,15.83,0.0,None,19,0.0,34,0.67
deepseek-chat,1.0,6,1.0,13.18,0.0,None,19,0.0,35,0.67
Qwen3-30B-Non-Thinking,1.0,6,1.0,8.13,0.0,None,19,0.0,36,0.67
Qwen2.5-72B,1.0,6,1.0,8.32,0.0,None,19,0.0,37,0.67
Qwen2.5-Coder-7B-Instruct,1.0,6,1.0,3.97,0.0,None,19,0.0,38,0.67
DeepSeek-Coder-V2-Lite-Instruct,1.0,6,1.0,4.99,0.0,None,19,0.0,39,0.67
DeepSeek-R1-Distill-Llama-8B,0.0,6,0.0,0.0,0.0,None,19,0.0,40,0.0
Qwen3-14B-Non-Thinking,0.0,6,0.0,0.0,0.0,None,19,0.0,41,0.0
Llama-4-Scout,0.0,6,0.0,6.61,0.0,None,19,0.0,42,0.0
Qwen3-4B-Non-Thinking,0.0,6,0.0,2.18,0.0,None,19,0.0,43,0.0
Qwen3-32B-Non-Thinking,0.0,6,0.0,8.64,0.0,None,19,0.0,44,0.0
DeepSeek-R1-Distill-Qwen-14B,0.0,6,0.0,1.22,0.0,None,19,0.0,46,0.0
Llama-3.3-70B-Instruct,0.0,6,0.0,4.79,0.0,None,19,0.0,45,0.0
Qwen2.5-Coder-14B-Instruct,0.0,6,0.0,6.08,0.0,None,19,0.0,47,0.0
