Model,Total Score,Included Tasks,Included Score,Avg Tests Passed,Pass Rate (%),Medal,Human Relative Rank,Human Percentile,Rank,Relative Score (%)
gpt-o3-mini-high,293.0,6,293.0,76.14,33.33,Bronze,24,58.92857142857143,1,48.83
gpt-5,251.0,6,251.0,71.38,33.33,None,30,48.214285714285715,2,41.83
gpt-oss-120b-high,224.0,6,224.0,66.65,33.33,None,34,41.07142857142857,3,37.33
gemini-2.5-flash,219.0,6,219.0,67.22,16.67,None,35,39.285714285714285,4,36.5
gemini-2.5-pro,194.0,6,194.0,61.28,0.0,None,37,35.714285714285715,5,32.33
claude-sonnet-4.5,174.0,6,174.0,70.93,0.0,None,40,30.357142857142858,6,29.0
grok-4-fast-reasoning,170.0,6,170.0,57.16,16.67,None,40,30.357142857142858,7,28.33
gpt-oss-120b_sp2,168.0,6,168.0,58.25,16.67,None,41,28.571428571428573,8,28.0
gpt-oss-120b_sp1,160.0,6,160.0,51.33,16.67,None,43,25.0,9,26.67
gpt-oss-120b_sp4,160.0,6,160.0,56.2,16.67,None,43,25.0,10,26.67
gpt-oss-20b-high,160.0,6,160.0,57.56,16.67,None,43,25.0,6,26.67
gpt-oss-120b-low,155.0,6,155.0,48.2,16.67,None,43,25.0,7,25.83
gpt-oss-120b-medium,155.0,6,155.0,51.18,16.67,None,43,25.0,8,25.83
Qwen3-32B-Non-Thinking,124.0,6,124.0,40.43,16.67,None,48,16.071428571428573,9,20.67
gpt-4.1,100.0,6,100.0,50.77,0.0,None,52,8.928571428571429,11,16.67
seed-oss_16384,74.0,6,74.0,43.88,0.0,None,55,3.5714285714285716,12,12.33
gpt-oss-20b-medium,60.0,6,60.0,43.27,0.0,None,55,3.5714285714285716,13,10.0
gpt-oss-120b_sp3,60.0,6,60.0,49.59,0.0,None,55,3.5714285714285716,18,10.0
seed-oss_2048,59.0,6,59.0,39.97,0.0,None,55,3.5714285714285716,14,9.83
seed-oss_8192,59.0,6,59.0,39.33,0.0,None,55,3.5714285714285716,15,9.83
seed-oss_-1,48.0,6,48.0,50.2,0.0,None,57,0.0,16,8.0
gpt-oss-20b-low,46.0,6,46.0,31.51,0.0,None,57,0.0,17,7.67
deepseek-chat,33.0,6,33.0,35.34,0.0,None,57,0.0,18,5.5
Qwen3-32B,30.0,6,30.0,41.65,0.0,None,57,0.0,19,5.0
deepseek-reasoner,30.0,6,30.0,42.07,0.0,None,57,0.0,20,5.0
Qwen3-14B-Non-Thinking,30.0,6,30.0,28.92,0.0,None,57,0.0,21,5.0
Qwen3-30B,26.0,6,26.0,28.93,0.0,None,57,0.0,22,4.33
Qwen3-30B-Non-Thinking,24.0,6,24.0,20.6,0.0,None,57,0.0,23,4.0
Qwen3-8B,22.0,6,22.0,36.29,0.0,None,57,0.0,24,3.67
Llama-3.3-70B-Instruct,20.0,6,20.0,14.73,0.0,None,57,0.0,25,3.33
Qwen3-8B-Non-Thinking,20.0,6,20.0,18.15,0.0,None,57,0.0,26,3.33
OpenCodeReasoning-Nemotron-32B-IOI,19.0,6,19.0,31.95,0.0,None,57,0.0,32,3.17
Qwen3-4B-Non-Thinking,17.0,6,17.0,11.48,0.0,None,57,0.0,27,2.83
OlympicCoder-7B,16.0,6,16.0,20.78,0.0,None,57,0.0,34,2.67
seed-oss_1024,16.0,6,16.0,27.1,0.0,None,57,0.0,28,2.67
Qwen3-4B,16.0,6,16.0,32.88,0.0,None,57,0.0,29,2.67
Codestral-22B-v0.1,16.0,6,16.0,20.27,0.0,None,57,0.0,30,2.67
Qwen2.5-Coder-14B-Instruct,16.0,6,16.0,20.93,0.0,None,57,0.0,31,2.67
Mistral-Large-Instruct-2411,13.0,6,13.0,19.75,0.0,None,57,0.0,33,2.17
Qwen2.5-Coder-32B-Instruct,110.0,6,110.0,33.22,16.67,None,51,10.714285714285714,10,18.33
Qwen2.5-72B,10.0,6,10.0,18.37,0.0,None,57,0.0,34,1.67
Mistral-Small-3.1-24B-2503,10.0,6,10.0,17.47,0.0,None,57,0.0,35,1.67
Qwen3-14B,6.0,6,6.0,20.74,0.0,None,57,0.0,36,1.0
seed-oss_4096,6.0,6,6.0,13.87,0.0,None,57,0.0,37,1.0
DeepSeek-R1-Distill-Llama-70B,6.0,6,6.0,23.23,0.0,None,57,0.0,39,1.0
QwQ-32B,15.0,6,15.0,29.26,0.0,None,57,0.0,32,2.5
DeepSeek-R1-Distill-Qwen-32B,6.0,6,6.0,20.85,0.0,None,57,0.0,38,1.0
DeepSeek-R1-Distill-Qwen-7B,0.0,6,0.0,0.0,0.0,None,57,0.0,40,0.0
DeepSeek-R1-Distill-Qwen-14B,0.0,6,0.0,5.54,0.0,None,57,0.0,45,0.0
DeepSeek-R1-Distill-Llama-8B,0.0,6,0.0,1.61,0.0,None,57,0.0,41,0.0
Qwen2.5-Coder-7B-Instruct,0.0,6,0.0,2.37,0.0,None,57,0.0,42,0.0
Llama-3.1-8B-Instruct,0.0,6,0.0,6.64,0.0,None,57,0.0,44,0.0
Llama-4-Scout,0.0,6,0.0,5.72,0.0,None,57,0.0,43,0.0
seed-oss_512,0.0,6,0.0,10.29,0.0,None,57,0.0,46,0.0
DeepSeek-Coder-V2-Lite-Instruct,0.0,6,0.0,6.53,0.0,None,57,0.0,47,0.0
seed-oss_0,0.0,6,0.0,15.85,0.0,None,57,0.0,48,0.0
