Model,Total Score,Included Tasks,Included Score,Avg Tests Passed,Pass Rate (%),Medal,Human Relative Rank,Human Percentile,Rank,Relative Score (%)
gemini-2.5-pro,489.0,6,489.0,83.18,66.67,Gold,6,97.38219895287958,1,81.5
grok-4-fast-reasoning,440.0,6,440.0,72.65,66.67,Gold,9,95.81151832460733,2,73.33
claude-sonnet-4.5,369.0,6,369.0,78.54,50.0,Gold,14,93.19371727748691,3,61.5
gpt-5,340.0,6,340.0,55.98,50.0,Gold,13,93.717277486911,2,56.67
gpt-o3-mini-high,340.0,6,340.0,65.09,50.0,Gold,13,93.717277486911,3,56.67
seed-oss_-1,329.0,6,329.0,57.42,33.33,Gold,13,93.717277486911,4,54.83
gemini-2.5-flash,319.0,6,319.0,68.83,33.33,Gold,14,93.19371727748691,5,53.17
gpt-oss-20b-medium,311.0,6,311.0,59.05,50.0,Gold,14,93.19371727748691,6,51.83
gpt-oss-120b_sp3,300.0,6,300.0,60.0,60.0,Silver,24,87.95811518324608,9,50.0
gpt-oss-20b-high,300.0,6,300.0,55.9,50.0,Silver,16,92.14659685863874,7,50.0
gpt-oss-20b-low,300.0,6,300.0,58.64,50.0,Silver,16,92.14659685863874,8,50.0
gpt-oss-120b_sp1,300.0,6,300.0,50.0,50.0,Silver,24,87.95811518324608,12,50.0
gpt-oss-120b-medium,289.0,6,289.0,49.37,33.33,Silver,17,91.62303664921465,9,48.17
Qwen3-14B,269.0,6,269.0,52.42,33.33,Silver,20,90.05235602094241,10,44.83
gpt-4.1,251.0,6,251.0,50.69,33.33,Silver,20,90.05235602094241,11,41.83
Qwen3-32B,251.0,6,251.0,48.84,33.33,Silver,20,90.05235602094241,12,41.83
gpt-oss-120b_sp4,240.0,6,240.0,57.54,40.0,Silver,35,82.19895287958116,17,40.0
seed-oss_16384,240.0,6,240.0,41.37,33.33,Silver,20,90.05235602094241,13,40.0
seed-oss_8192,240.0,6,240.0,46.34,33.33,Silver,20,90.05235602094241,14,40.0
OpenCodeReasoning-Nemotron-32B-IOI,240.0,6,240.0,40.77,33.33,Silver,35,82.19895287958116,20,40.0
OlympicCoder-7B,240.0,6,240.0,46.44,33.33,Silver,35,82.19895287958116,21,40.0
deepseek-reasoner,240.0,6,240.0,46.81,33.33,Silver,20,90.05235602094241,16,40.0
QwQ-32B,240.0,6,240.0,46.81,33.33,Silver,20,90.05235602094241,15,40.0
gpt-oss-120b-high,229.0,6,229.0,47.14,33.33,Silver,22,89.00523560209425,17,38.17
gpt-oss-120b_sp2,200.0,6,200.0,41.97,33.33,Silver,41,79.05759162303664,25,33.33
gpt-oss-120b-low,200.0,6,200.0,35.21,33.33,Silver,28,85.86387434554973,18,33.33
deepseek-chat,200.0,6,200.0,37.25,33.33,Silver,28,85.86387434554973,19,33.33
Qwen3-8B,191.0,6,191.0,47.54,16.67,Silver,28,85.86387434554973,20,31.83
seed-oss_2048,140.0,6,140.0,24.91,16.67,Silver,33,83.24607329842932,21,23.33
seed-oss_1024,140.0,6,140.0,26.03,16.67,Silver,33,83.24607329842932,22,23.33
seed-oss_4096,140.0,6,140.0,23.75,16.67,Silver,33,83.24607329842932,23,23.33
DeepSeek-R1-Distill-Llama-70B,140.0,6,140.0,30.66,16.67,Silver,33,83.24607329842932,24,23.33
DeepSeek-R1-Distill-Qwen-14B,130.0,6,130.0,42.26,0.0,Silver,36,81.67539267015707,25,21.67
DeepSeek-R1-Distill-Qwen-32B,130.0,6,130.0,43.3,0.0,Silver,36,81.67539267015707,26,21.67
Qwen3-4B,100.0,6,100.0,30.36,0.0,Silver,47,75.91623036649214,27,16.67
Llama-3.3-70B-Instruct,60.0,6,60.0,27.8,0.0,Bronze,71,63.35078534031414,28,10.0
Mistral-Small-3.1-24B-2503,40.0,6,40.0,17.18,0.0,Bronze,81,58.1151832460733,30,6.67
Qwen3-8B-Non-Thinking,40.0,6,40.0,18.74,0.0,Bronze,81,58.1151832460733,29,6.67
Qwen3-30B-Non-Thinking,40.0,6,40.0,17.18,0.0,Bronze,81,58.1151832460733,31,6.67
Qwen3-32B-Non-Thinking,40.0,6,40.0,19.01,0.0,Bronze,81,58.1151832460733,32,6.67
Qwen2.5-Coder-32B-Instruct,40.0,6,40.0,18.91,0.0,Bronze,81,58.1151832460733,33,6.67
Llama-3.1-8B-Instruct,5.0,6,5.0,14.1,0.0,None,121,37.17277486910995,34,0.83
seed-oss_512,0.0,6,0.0,0.0,0.0,None,123,36.12565445026178,35,0.0
Qwen3-30B,0.0,6,0.0,0.0,0.0,None,123,36.12565445026178,36,0.0
DeepSeek-R1-Distill-Qwen-7B,0.0,6,0.0,0.0,0.0,None,123,36.12565445026178,37,0.0
Qwen3-4B-Non-Thinking,0.0,6,0.0,0.52,0.0,None,123,36.12565445026178,40,0.0
seed-oss_0,0.0,6,0.0,2.39,0.0,None,123,36.12565445026178,39,0.0
Qwen3-14B-Non-Thinking,0.0,6,0.0,5.15,0.0,None,123,36.12565445026178,41,0.0
Qwen2.5-Coder-14B-Instruct,0.0,6,0.0,2.87,0.0,None,123,36.12565445026178,42,0.0
DeepSeek-R1-Distill-Llama-8B,0.0,6,0.0,0.0,0.0,None,123,36.12565445026178,38,0.0
Qwen2.5-Coder-7B-Instruct,0.0,6,0.0,2.39,0.0,None,123,36.12565445026178,44,0.0
Codestral-22B-v0.1,0.0,6,0.0,2.45,0.0,None,123,36.12565445026178,45,0.0
Mistral-Large-Instruct-2411,0.0,6,0.0,8.79,0.0,None,123,36.12565445026178,46,0.0
Llama-4-Scout,0.0,6,0.0,2.39,0.0,None,123,36.12565445026178,43,0.0
DeepSeek-Coder-V2-Lite-Instruct,0.0,6,0.0,0.72,0.0,None,123,36.12565445026178,47,0.0
Qwen2.5-72B,0.0,6,0.0,7.7,0.0,None,123,36.12565445026178,48,0.0
