Model,Total Score,Included Tasks,Included Score,Avg Tests Passed,Pass Rate (%),Medal,Human Relative Rank,Human Percentile,Rank,Relative Score (%)
gpt-oss-120b-high,656.0,8,656.0,88.01,75.0,Gold,4,99.3006993006993,1,82.0
gpt-5,540.0,8,540.0,82.34,50.0,Gold,27,93.93939393939394,2,67.5
gpt-oss-120b_sp1,524.0,8,524.0,68.12,62.5,Gold,33,92.54079254079254,3,65.5
gpt-oss-120b-medium,475.0,8,475.0,69.93,50.0,Silver,60,86.24708624708624,3,59.38
gpt-oss-20b-medium,464.0,8,464.0,63.57,50.0,Silver,72,83.44988344988344,4,58.0
gemini-2.5-pro,447.0,8,447.0,72.49,12.5,Bronze,94,78.32167832167832,5,55.88
gpt-oss-20b-high,443.0,8,443.0,63.61,57.14,Bronze,99,77.15617715617715,6,55.38
gpt-oss-120b_sp2,428.0,8,428.0,57.48,50.0,Bronze,113,73.89277389277389,8,53.5
gpt-o3-mini-high,423.0,8,423.0,65.44,50.0,Bronze,121,72.02797202797203,7,52.88
gpt-oss-20b-low,404.0,8,404.0,53.89,50.0,None,140,67.5990675990676,8,50.5
grok-4-fast-reasoning,398.0,8,398.0,60.77,37.5,None,144,66.66666666666667,11,49.75
seed-oss_-1,397.0,8,397.0,58.28,37.5,None,144,66.66666666666667,9,49.62
QwQ-32B,392.0,8,392.0,51.25,37.5,None,152,64.8018648018648,10,49.0
gpt-oss-120b_sp3,386.0,8,386.0,57.68,37.5,None,162,62.47086247086247,14,48.25
gpt-oss-120b-low,380.0,8,380.0,59.02,37.5,None,170,60.60606060606061,11,47.5
seed-oss_16384,379.0,8,379.0,53.98,37.5,None,170,60.60606060606061,12,47.38
deepseek-reasoner,369.0,8,369.0,50.87,37.5,None,180,58.27505827505828,13,46.12
Qwen3-30B,358.0,8,358.0,53.1,37.5,None,190,55.94405594405595,14,44.75
Qwen3-14B,354.0,8,354.0,48.24,37.5,None,192,55.47785547785548,15,44.25
OpenCodeReasoning-Nemotron-32B-IOI,354.0,8,354.0,45.64,37.5,None,192,55.47785547785548,20,44.25
gpt-oss-120b_sp4,353.0,8,353.0,53.26,25.0,None,196,54.54545454545455,21,44.12
DeepSeek-R1-Distill-Llama-70B,343.0,8,343.0,51.52,37.5,None,211,51.04895104895105,16,42.88
Qwen3-8B,342.0,8,342.0,53.57,37.5,None,214,50.34965034965035,17,42.75
Qwen3-32B,320.0,8,320.0,42.95,25.0,None,236,45.22144522144522,18,40.0
gemini-2.5-flash,289.0,8,289.0,45.56,25.0,None,270,37.2960372960373,19,36.12
seed-oss_8192,263.0,8,263.0,38.35,12.5,None,301,30.06993006993007,20,32.88
Llama-3.3-70B-Instruct,256.0,8,256.0,47.24,25.0,None,308,28.43822843822844,21,32.0
OlympicCoder-7B,252.0,8,252.0,41.57,25.0,None,313,27.272727272727273,28,31.5
Qwen3-4B,242.0,8,242.0,41.74,25.0,None,319,25.874125874125873,22,30.25
deepseek-chat,238.0,8,238.0,33.32,25.0,None,323,24.941724941724942,23,29.75
gpt-4.1,228.0,8,228.0,33.24,25.0,None,329,23.543123543123542,24,28.5
Mistral-Large-Instruct-2411,227.0,8,227.0,37.47,25.0,None,330,23.31002331002331,25,28.38
Qwen2.5-Coder-32B-Instruct,218.0,8,218.0,36.85,12.5,None,334,22.377622377622377,26,27.25
seed-oss_1024,216.0,8,216.0,29.26,25.0,None,335,22.144522144522146,27,27.0
Qwen2.5-72B,211.0,8,211.0,35.02,12.5,None,337,21.678321678321677,28,26.38
Qwen2.5-Coder-14B-Instruct,208.0,8,208.0,34.76,25.0,None,337,21.678321678321677,29,26.0
Qwen3-32B-Non-Thinking,204.0,8,204.0,31.28,25.0,None,337,21.678321678321677,30,25.5
DeepSeek-R1-Distill-Qwen-14B,204.0,8,204.0,30.37,12.5,None,337,21.678321678321677,31,25.5
seed-oss_4096,202.0,8,202.0,30.28,12.5,None,341,20.745920745920746,32,25.25
DeepSeek-R1-Distill-Qwen-32B,186.0,8,186.0,33.8,0.0,None,355,17.482517482517483,33,23.25
Codestral-22B-v0.1,182.0,8,182.0,23.29,12.5,None,359,16.55011655011655,34,22.75
seed-oss_2048,176.0,8,176.0,24.47,12.5,None,365,15.151515151515152,35,22.0
claude-sonnet-4.5,172.0,8,172.0,48.18,0.0,None,367,14.685314685314685,43,21.5
Llama-3.1-8B-Instruct,170.0,8,170.0,22.65,0.0,None,371,13.752913752913752,36,21.25
Mistral-Small-3.1-24B-2503,166.0,8,166.0,22.98,12.5,None,372,13.51981351981352,37,20.75
DeepSeek-Coder-V2-Lite-Instruct,163.0,8,163.0,27.5,0.0,None,375,12.820512820512821,38,20.38
Llama-4-Scout,123.0,8,123.0,20.72,0.0,None,402,6.526806526806527,39,15.38
seed-oss_0,117.0,8,117.0,24.97,12.5,None,407,5.361305361305361,40,14.62
Qwen3-14B-Non-Thinking,104.0,8,104.0,20.81,12.5,None,410,4.662004662004662,42,13.0
Qwen3-4B-Non-Thinking,104.0,8,104.0,21.42,12.5,None,410,4.662004662004662,41,13.0
Qwen2.5-Coder-7B-Instruct,100.0,8,100.0,16.3,12.5,None,414,3.7296037296037294,43,12.5
Qwen3-30B-Non-Thinking,100.0,8,100.0,16.69,12.5,None,414,3.7296037296037294,44,12.5
seed-oss_512,36.0,8,36.0,6.25,0.0,None,424,1.3986013986013985,45,4.5
Qwen3-8B-Non-Thinking,15.0,8,15.0,4.54,0.0,None,426,0.9324009324009324,46,1.88
DeepSeek-R1-Distill-Llama-8B,0.0,8,0.0,0.0,0.0,None,429,0.2331002331002331,47,0.0
