Model,Total Score,Included Tasks,Included Score,Avg Tests Passed,Pass Rate (%),Medal,Human Relative Rank,Human Percentile,Rank,Relative Score (%)
gpt-oss-120b_sp3,450.0,5,450.0,100.0,100.0,Gold,1,100.0,1,90.0
gpt-oss-120b_sp4,450.0,5,450.0,100.0,100.0,Gold,1,100.0,2,90.0
seed-oss_16384,450.0,5,450.0,100.0,100.0,Gold,1,100.0,5,90.0
gpt-oss-20b-high,450.0,5,450.0,100.0,100.0,Gold,1,100.0,1,90.0
gpt-5,450.0,5,450.0,100.0,100.0,Gold,1,100.0,3,90.0
gpt-oss-120b-high,450.0,5,450.0,100.0,100.0,Gold,1,100.0,2,90.0
gpt-oss-120b-medium,450.0,5,450.0,100.0,100.0,Gold,1,100.0,4,90.0
grok-4-fast-reasoning,450.0,5,450.0,100.0,100.0,Gold,1,100.0,8,90.0
gpt-oss-120b_sp2,450.0,5,450.0,100.0,100.0,Gold,1,100.0,9,90.0
gpt-oss-120b_sp1,450.0,5,450.0,100.0,100.0,Gold,1,100.0,10,90.0
seed-oss_-1,450.0,5,450.0,100.0,100.0,Gold,1,100.0,6,90.0
gpt-oss-120b-low,419.0,5,419.0,95.5,80.0,Gold,7,97.02970297029702,8,83.8
gemini-2.5-pro,419.0,5,419.0,95.5,80.0,Gold,7,97.02970297029702,7,83.8
gpt-o3-mini-high,380.0,5,380.0,80.0,80.0,Gold,9,96.03960396039604,9,76.0
seed-oss_8192,361.0,5,361.0,89.5,80.0,Gold,9,96.03960396039604,10,72.2
gemini-2.5-flash,361.0,5,361.0,93.0,80.0,Gold,9,96.03960396039604,11,72.2
gpt-oss-20b-medium,361.0,5,361.0,93.0,80.0,Gold,9,96.03960396039604,12,72.2
QwQ-32B,348.0,5,348.0,74.9,60.0,Gold,9,96.03960396039604,13,69.6
gpt-oss-20b-low,340.0,5,340.0,82.0,80.0,Gold,9,96.03960396039604,14,68.0
Qwen3-32B,315.0,5,315.0,73.0,60.0,Gold,13,94.05940594059406,17,63.0
Qwen3-14B,315.0,5,315.0,73.0,60.0,Gold,13,94.05940594059406,15,63.0
deepseek-reasoner,315.0,5,315.0,72.5,60.0,Gold,13,94.05940594059406,16,63.0
claude-sonnet-4.5,311.0,5,311.0,69.8,60.0,Gold,13,94.05940594059406,23,62.2
Qwen3-30B,290.0,5,290.0,65.7,60.0,Gold,14,93.56435643564356,18,58.0
OpenCodeReasoning-Nemotron-32B-IOI,205.0,5,205.0,52.0,40.0,Silver,28,86.63366336633663,25,41.0
gpt-4.1,205.0,5,205.0,52.5,40.0,Silver,28,86.63366336633663,19,41.0
DeepSeek-R1-Distill-Llama-70B,160.0,5,160.0,41.0,40.0,Bronze,54,73.76237623762377,20,32.0
Qwen3-8B,133.0,5,133.0,38.23,20.0,Bronze,65,68.31683168316832,21,26.6
Qwen2.5-72B,128.0,5,128.0,39.66,20.0,Bronze,65,68.31683168316832,22,25.6
Llama-3.3-70B-Instruct,70.0,5,70.0,24.9,20.0,Bronze,85,58.415841584158414,26,14.0
Mistral-Small-3.1-24B-2503,70.0,5,70.0,24.9,20.0,Bronze,85,58.415841584158414,24,14.0
Qwen3-4B-Non-Thinking,70.0,5,70.0,24.4,20.0,Bronze,85,58.415841584158414,23,14.0
Qwen2.5-Coder-32B-Instruct,70.0,5,70.0,37.66,20.0,Bronze,85,58.415841584158414,29,14.0
Qwen3-30B-Non-Thinking,70.0,5,70.0,29.66,20.0,Bronze,85,58.415841584158414,27,14.0
deepseek-chat,70.0,5,70.0,30.46,20.0,Bronze,85,58.415841584158414,25,14.0
Qwen3-4B,70.0,5,70.0,29.66,20.0,Bronze,85,58.415841584158414,28,14.0
Qwen3-32B-Non-Thinking,0.0,5,0.0,0.0,0.0,None,194,4.455445544554456,46,0.0
seed-oss_0,50.0,5,50.0,20.99,20.0,None,152,25.247524752475247,30,10.0
seed-oss_4096,50.0,5,50.0,24.76,20.0,None,152,25.247524752475247,31,10.0
seed-oss_2048,50.0,5,50.0,23.81,20.0,None,152,25.247524752475247,32,10.0
Qwen3-14B-Non-Thinking,50.0,5,50.0,24.63,20.0,None,152,25.247524752475247,35,10.0
seed-oss_1024,50.0,5,50.0,24.76,20.0,None,152,25.247524752475247,33,10.0
DeepSeek-R1-Distill-Qwen-14B,50.0,5,50.0,23.67,20.0,None,152,25.247524752475247,34,10.0
DeepSeek-R1-Distill-Qwen-32B,50.0,5,50.0,25.26,20.0,None,152,25.247524752475247,36,10.0
Qwen3-8B-Non-Thinking,39.0,5,39.0,13.81,0.0,None,186,8.415841584158416,37,7.8
Qwen2.5-Coder-14B-Instruct,39.0,5,39.0,14.31,0.0,None,186,8.415841584158416,38,7.8
Mistral-Large-Instruct-2411,39.0,5,39.0,14.81,0.0,None,186,8.415841584158416,39,7.8
DeepSeek-Coder-V2-Lite-Instruct,39.0,5,39.0,13.81,0.0,None,186,8.415841584158416,40,7.8
Llama-4-Scout,26.0,5,26.0,18.6,0.0,None,187,7.920792079207921,41,5.2
Codestral-22B-v0.1,20.0,5,20.0,6.75,0.0,None,187,7.920792079207921,42,4.0
Qwen2.5-Coder-7B-Instruct,20.0,5,20.0,4.4,0.0,None,187,7.920792079207921,44,4.0
Llama-3.1-8B-Instruct,20.0,5,20.0,4.4,0.0,None,187,7.920792079207921,43,4.0
seed-oss_512,19.0,5,19.0,9.91,0.0,None,192,5.445544554455446,45,3.8
DeepSeek-R1-Distill-Llama-8B,0.0,5,0.0,0.0,0.0,None,194,4.455445544554456,47,0.0
OlympicCoder-7B,0.0,5,0.0,4.76,0.0,None,194,4.455445544554456,55,0.0
