Model,Total Score,Included Tasks,Included Score,Avg Tests Passed,Pass Rate (%),Medal,Human Relative Rank,Human Percentile,Rank,Relative Score (%)
gpt-oss-120b_sp1,224.0,6,224.0,48.51,16.67,Bronze,30,59.72222222222222,1,37.33
grok-4-fast-reasoning,220.0,6,220.0,74.8,16.67,Bronze,30,59.72222222222222,2,36.67
gpt-oss-20b-high,0.0,6,0.0,0.0,0.0,None,73,0.0,34,0.0
gpt-5,187.0,6,187.0,63.83,16.67,None,37,50.0,2,31.17
gpt-oss-120b_sp2,181.0,6,181.0,48.2,16.67,None,37,50.0,5,30.17
gemini-2.5-pro,181.0,6,181.0,74.46,16.67,None,37,50.0,3,30.17
seed-oss_16384,169.0,6,169.0,61.85,16.67,None,37,50.0,4,28.17
seed-oss_-1,193.0,6,193.0,71.87,16.67,None,35,52.77777777777778,1,32.17
gpt-oss-120b_sp3,147.0,6,147.0,50.91,16.67,None,46,37.5,9,24.5
gpt-oss-120b-high,136.0,6,136.0,56.5,16.67,None,47,36.111111111111114,5,22.67
gpt-oss-120b_sp4,134.0,6,134.0,47.0,16.67,None,48,34.72222222222222,11,22.33
seed-oss_8192,132.0,6,132.0,38.14,16.67,None,48,34.72222222222222,6,22.0
gpt-oss-120b-medium,128.0,6,128.0,52.26,16.67,None,49,33.333333333333336,7,21.33
gpt-oss-20b-medium,119.0,6,119.0,45.5,16.67,None,51,30.555555555555557,8,19.83
claude-sonnet-4.5,110.0,6,110.0,63.23,0.0,None,55,25.0,15,18.33
gpt-o3-mini-high,92.0,6,92.0,37.45,0.0,None,56,23.61111111111111,9,15.33
gemini-2.5-flash,92.0,6,92.0,51.12,0.0,None,56,23.61111111111111,10,15.33
deepseek-reasoner,60.0,6,60.0,49.95,0.0,None,61,16.666666666666668,11,10.0
gpt-oss-120b-low,47.0,6,47.0,36.59,0.0,None,64,12.5,12,7.83
gpt-4.1,47.0,6,47.0,28.65,0.0,None,64,12.5,13,7.83
Qwen3-30B,37.0,6,37.0,43.0,0.0,None,66,9.722222222222221,15,6.17
gpt-oss-20b-low,28.0,6,28.0,25.12,0.0,None,67,8.333333333333334,17,4.67
OlympicCoder-32B,26.0,6,26.0,19.15,0.0,None,68,6.944444444444445,23,4.33
Qwen3-14B,24.0,6,24.0,47.7,0.0,None,69,5.555555555555555,18,4.0
Llama-4-Scout,20.0,6,20.0,18.33,0.0,None,70,4.166666666666667,19,3.33
Qwen2.5-72B,20.0,6,20.0,22.81,0.0,None,70,4.166666666666667,20,3.33
DeepSeek-R1-Distill-Llama-70B,19.0,6,19.0,22.97,0.0,None,70,4.166666666666667,21,3.17
OpenCodeReasoning-Nemotron-32B-IOI,19.0,6,19.0,29.17,0.0,None,70,4.166666666666667,28,3.17
Qwen2.5-Coder-14B-Instruct,17.0,6,17.0,13.78,0.0,None,72,1.3888888888888888,22,2.83
Qwen3-32B,40.0,6,40.0,42.77,0.0,None,66,9.722222222222221,14,6.67
deepseek-chat,13.0,6,13.0,21.34,0.0,None,72,1.3888888888888888,23,2.17
Qwen3-8B-Non-Thinking,11.0,6,11.0,11.8,0.0,None,72,1.3888888888888888,25,1.83
seed-oss_4096,11.0,6,11.0,17.47,0.0,None,72,1.3888888888888888,27,1.83
Qwen3-30B-Non-Thinking,11.0,6,11.0,15.46,0.0,None,72,1.3888888888888888,26,1.83
Mistral-Large-Instruct-2411,9.0,6,9.0,10.97,0.0,None,73,0.0,28,1.5
DeepSeek-Coder-V2-Lite-Instruct,9.0,6,9.0,12.86,0.0,None,73,0.0,29,1.5
Qwen3-8B,9.0,6,9.0,27.01,0.0,None,73,0.0,30,1.5
Codestral-22B-v0.1,8.0,6,8.0,7.75,0.0,None,73,0.0,31,1.33
Qwen3-4B,8.0,6,8.0,9.28,0.0,None,73,0.0,33,1.33
DeepSeek-R1-Distill-Qwen-7B,0.0,6,0.0,0.0,0.0,None,73,0.0,35,0.0
DeepSeek-R1-Distill-Llama-8B,0.0,6,0.0,0.0,0.0,None,73,0.0,36,0.0
Qwen2.5-Coder-7B-Instruct,0.0,6,0.0,2.45,0.0,None,73,0.0,38,0.0
Qwen3-14B-Non-Thinking,0.0,6,0.0,0.0,0.0,None,73,0.0,37,0.0
seed-oss_1024,0.0,6,0.0,3.87,0.0,None,73,0.0,40,0.0
Llama-3.3-70B-Instruct,0.0,6,0.0,5.61,0.0,None,73,0.0,42,0.0
seed-oss_2048,0.0,6,0.0,3.55,0.0,None,73,0.0,43,0.0
seed-oss_512,0.0,6,0.0,8.74,0.0,None,73,0.0,44,0.0
OlympicCoder-7B,0.0,6,0.0,5.68,0.0,None,73,0.0,48,0.0
Llama-3.1-8B-Instruct,0.0,6,0.0,2.76,0.0,None,73,0.0,39,0.0
Qwen3-4B-Non-Thinking,0.0,6,0.0,4.32,0.0,None,73,0.0,41,0.0
DeepSeek-R1-Distill-Qwen-32B,11.0,6,11.0,11.59,0.0,None,72,1.3888888888888888,24,1.83
DeepSeek-R1-Distill-Qwen-14B,8.0,6,8.0,9.7,0.0,None,73,0.0,32,1.33
Mistral-Small-3.1-24B-2503,0.0,6,0.0,4.27,0.0,None,73,0.0,45,0.0
Qwen2.5-Coder-32B-Instruct,0.0,6,0.0,7.96,0.0,None,73,0.0,46,0.0
QwQ-32B,37.0,6,37.0,36.54,0.0,None,66,9.722222222222221,16,6.17
Qwen3-32B-Non-Thinking,0.0,6,0.0,10.24,0.0,None,73,0.0,47,0.0
seed-oss_0,0.0,6,0.0,6.2,0.0,None,73,0.0,48,0.0
