Model,Total Score,Included Tasks,Included Score,Avg Tests Passed,Pass Rate (%),Medal,Human Relative Rank,Human Percentile,Rank,Relative Score (%)
gpt-5,517.0,6,517.0,92.4,66.67,Gold,4,98.02631578947368,1,86.17
gpt-oss-120b-high,459.0,6,459.0,86.13,66.67,Gold,6,96.71052631578948,2,76.5
grok-4-fast-reasoning,410.0,6,410.0,83.77,50.0,Gold,11,93.42105263157895,3,68.33
gemini-2.5-pro,250.0,6,250.0,58.89,16.67,Silver,35,77.63157894736842,4,41.67
gpt-oss-20b-high,304.0,6,304.0,74.86,40.0,Silver,26,83.55263157894737,3,50.67
gpt-oss-120b_sp3,265.0,6,265.0,61.8,33.33,Silver,33,78.94736842105263,6,44.17
gpt-oss-120b_sp1,251.0,6,251.0,53.23,33.33,Silver,34,78.28947368421052,7,41.83
gpt-oss-120b-medium,240.0,6,240.0,52.39,33.33,Bronze,40,74.34210526315789,5,40.0
gpt-o3-mini-high,240.0,6,240.0,52.67,33.33,Bronze,40,74.34210526315789,6,40.0
gpt-oss-120b_sp4,240.0,6,240.0,52.39,33.33,Bronze,40,74.34210526315789,10,40.0
gpt-oss-120b_sp2,181.0,6,181.0,48.83,16.67,Bronze,57,63.1578947368421,11,30.17
gemini-2.5-flash,214.0,6,214.0,56.79,16.67,Bronze,51,67.10526315789474,7,35.67
gpt-oss-20b-medium,175.0,6,175.0,46.02,16.67,Bronze,60,61.18421052631579,8,29.17
seed-oss_-1,161.0,6,161.0,40.95,16.67,Bronze,66,57.23684210526316,9,26.83
gpt-oss-120b-low,159.0,6,159.0,43.57,16.67,Bronze,67,56.578947368421055,10,26.5
claude-sonnet-4.5,154.0,6,154.0,56.98,0.0,Bronze,69,55.26315789473684,16,25.67
seed-oss_16384,113.0,6,113.0,46.12,0.0,None,85,44.73684210526316,11,18.83
Qwen3-32B,85.0,6,85.0,39.73,0.0,None,106,30.92105263157895,12,14.17
seed-oss_8192,76.0,6,76.0,29.43,0.0,None,110,28.289473684210527,13,12.67
gpt-4.1,9.0,6,9.0,7.09,0.0,None,144,5.921052631578948,17,1.5
deepseek-reasoner,68.0,6,68.0,32.08,0.0,None,113,26.31578947368421,14,11.33
gpt-oss-20b-low,64.0,6,64.0,21.78,0.0,None,115,25.0,15,10.67
Qwen3-30B,0.0,6,0.0,0.79,0.0,None,148,3.289473684210526,23,0.0
DeepSeek-R1-Distill-Qwen-32B,9.0,6,9.0,5.78,0.0,None,144,5.921052631578948,18,1.5
OpenCodeReasoning-Nemotron-32B-IOI,37.0,6,37.0,17.03,0.0,None,132,13.81578947368421,25,6.17
QwQ-32B,6.0,6,6.0,4.4,0.0,None,148,3.289473684210526,20,1.0
Qwen3-8B-Non-Thinking,0.0,6,0.0,0.0,0.0,None,148,3.289473684210526,30,0.0
Qwen2.5-72B,0.0,6,0.0,3.03,0.0,None,148,3.289473684210526,45,0.0
deepseek-chat,22.0,6,22.0,15.93,0.0,None,139,9.210526315789474,16,3.67
Qwen3-14B,0.0,6,0.0,0.79,0.0,None,148,3.289473684210526,26,0.0
OlympicCoder-7B,11.0,6,11.0,15.77,0.0,None,143,6.578947368421052,31,1.83
Llama-3.3-70B-Instruct,0.0,6,0.0,0.79,0.0,None,148,3.289473684210526,40,0.0
DeepSeek-R1-Distill-Llama-70B,0.0,6,0.0,0.0,0.0,None,148,3.289473684210526,35,0.0
seed-oss_0,6.0,6,6.0,8.33,0.0,None,148,3.289473684210526,19,1.0
Qwen3-4B-Non-Thinking,0.0,6,0.0,0.0,0.0,None,148,3.289473684210526,34,0.0
Qwen3-30B-Non-Thinking,0.0,6,0.0,3.82,0.0,None,148,3.289473684210526,46,0.0
Qwen3-8B,4.0,6,4.0,11.33,0.0,None,148,3.289473684210526,21,0.67
Qwen3-4B,4.0,6,4.0,8.24,0.0,None,148,3.289473684210526,22,0.67
DeepSeek-R1-Distill-Qwen-14B,0.0,6,0.0,0.0,0.0,None,148,3.289473684210526,24,0.0
DeepSeek-R1-Distill-Llama-8B,0.0,6,0.0,0.0,0.0,None,148,3.289473684210526,29,0.0
Codestral-22B-v0.1,0.0,6,0.0,0.0,0.0,None,148,3.289473684210526,31,0.0
DeepSeek-R1-Distill-Qwen-7B,0.0,6,0.0,0.0,0.0,None,148,3.289473684210526,36,0.0
seed-oss_1024,0.0,6,0.0,3.17,0.0,None,148,3.289473684210526,37,0.0
Llama-4-Scout,0.0,6,0.0,0.0,0.0,None,148,3.289473684210526,38,0.0
Llama-3.1-8B-Instruct,0.0,6,0.0,0.0,0.0,None,148,3.289473684210526,25,0.0
Qwen3-14B-Non-Thinking,0.0,6,0.0,0.0,0.0,None,148,3.289473684210526,33,0.0
seed-oss_512,0.0,6,0.0,1.33,0.0,None,148,3.289473684210526,41,0.0
seed-oss_2048,0.0,6,0.0,2.86,0.0,None,148,3.289473684210526,43,0.0
seed-oss_4096,0.0,6,0.0,6.73,0.0,None,148,3.289473684210526,44,0.0
Qwen3-32B-Non-Thinking,0.0,6,0.0,9.79,0.0,None,148,3.289473684210526,42,0.0
Qwen2.5-Coder-7B-Instruct,0.0,6,0.0,0.0,0.0,None,148,3.289473684210526,27,0.0
Mistral-Large-Instruct-2411,0.0,6,0.0,0.0,0.0,None,148,3.289473684210526,32,0.0
Qwen2.5-Coder-32B-Instruct,0.0,6,0.0,3.03,0.0,None,148,3.289473684210526,48,0.0
DeepSeek-Coder-V2-Lite-Instruct,0.0,6,0.0,1.33,0.0,None,148,3.289473684210526,47,0.0
Mistral-Small-3.1-24B-2503,0.0,6,0.0,2.33,0.0,None,148,3.289473684210526,39,0.0
Qwen2.5-Coder-14B-Instruct,0.0,6,0.0,0.0,0.0,None,148,3.289473684210526,28,0.0
