Model,Total Score,Included Tasks,Included Score,Avg Tests Passed,Pass Rate (%),Medal,Human Relative Rank,Human Percentile,Rank,Relative Score (%)
grok-4-fast-reasoning,198.0,6,198.0,58.1,16.67,None,214,41.80327868852459,1,33.0
gpt-oss-120b-high,167.0,6,167.0,49.53,16.67,None,256,30.327868852459016,1,27.83
gpt-5,160.0,6,160.0,44.24,16.67,None,262,28.688524590163933,2,26.67
gemini-2.5-pro,134.0,6,134.0,50.02,0.0,None,286,22.131147540983605,3,22.33
seed-oss_-1,122.0,6,122.0,47.3,0.0,None,295,19.672131147540984,5,20.33
deepseek-reasoner,120.0,6,120.0,44.63,0.0,None,295,19.672131147540984,6,20.0
gpt-oss-120b-medium,119.0,6,119.0,41.63,0.0,None,295,19.672131147540984,7,19.83
gemini-2.5-flash,112.0,6,112.0,39.91,0.0,None,300,18.306010928961747,8,18.67
gpt-oss-120b_sp1,112.0,6,112.0,38.49,0.0,None,300,18.306010928961747,9,18.67
Qwen3-14B,102.0,6,102.0,32.42,0.0,None,305,16.939890710382514,11,17.0
gpt-oss-120b_sp3,102.0,6,102.0,35.25,0.0,None,305,16.939890710382514,11,17.0
OlympicCoder-32B,99.0,6,99.0,32.14,0.0,None,308,16.120218579234972,12,16.5
gpt-oss-120b_sp2,97.0,6,97.0,32.03,0.0,None,309,15.846994535519126,13,16.17
gpt-oss-20b-low,97.0,6,97.0,32.21,0.0,None,309,15.846994535519126,12,16.17
Qwen3-8B,92.0,6,92.0,28.7,0.0,None,317,13.66120218579235,14,15.33
gpt-oss-120b-low,90.0,6,90.0,29.59,0.0,None,317,13.66120218579235,15,15.0
gpt-oss-120b_sp4,90.0,6,90.0,28.96,0.0,None,317,13.66120218579235,17,15.0
gpt-oss-20b-medium,87.0,6,87.0,31.47,0.0,None,321,12.568306010928962,17,14.5
gpt-oss-20b-high,87.0,6,87.0,31.96,0.0,None,321,12.568306010928962,16,14.5
claude-sonnet-4.5,77.0,6,77.0,36.69,0.0,None,332,9.562841530054644,20,12.83
gpt-o3-mini-high,67.0,6,67.0,19.0,0.0,None,340,7.377049180327869,18,11.17
DeepSeek-R1-Distill-Llama-70B,62.0,6,62.0,31.53,0.0,None,342,6.830601092896175,19,10.33
claude-haiku-4.5,58.0,6,58.0,28.37,0.0,None,344,6.284153005464481,23,9.67
Qwen3-4B,45.0,6,45.0,20.89,0.0,None,348,5.191256830601093,20,7.5
gpt-4.1,42.0,6,42.0,20.06,0.0,None,350,4.644808743169399,21,7.0
Qwen3-8B-Non-Thinking,42.0,6,42.0,15.82,0.0,None,350,4.644808743169399,23,7.0
Qwen3-30B-Non-Thinking,42.0,6,42.0,15.82,0.0,None,350,4.644808743169399,24,7.0
deepseek-chat,37.0,6,37.0,15.98,0.0,None,352,4.098360655737705,26,6.17
Llama-4-Scout,29.0,6,29.0,15.29,0.0,None,354,3.551912568306011,29,4.83
Qwen3-14B-Non-Thinking,22.0,6,22.0,12.59,0.0,None,355,3.278688524590164,37,3.67
Qwen3-32B-Non-Thinking,25.0,6,25.0,12.81,0.0,None,354,3.551912568306011,32,4.17
Qwen2.5-Coder-14B-Instruct,25.0,6,25.0,14.74,0.0,None,354,3.551912568306011,33,4.17
Mistral-Small-3.1-24B-2503,22.0,6,22.0,12.02,0.0,None,355,3.278688524590164,34,3.67
Qwen3-4B-Non-Thinking,22.0,6,22.0,11.94,0.0,None,355,3.278688524590164,36,3.67
DeepSeek-Coder-V2-Lite-Instruct,22.0,6,22.0,12.16,0.0,None,355,3.278688524590164,35,3.67
Mistral-Large-Instruct-2411,22.0,6,22.0,14.27,0.0,None,355,3.278688524590164,38,3.67
OlympicCoder-7B,17.0,6,17.0,8.91,0.0,None,356,3.0054644808743167,37,2.83
Qwen2.5-Coder-7B-Instruct,12.0,6,12.0,10.97,0.0,None,359,2.185792349726776,41,2.0
Llama-3.1-8B-Instruct,12.0,6,12.0,9.68,0.0,None,359,2.185792349726776,42,2.0
OpenCodeReasoning-Nemotron-32B-IOI,10.0,6,10.0,6.82,0.0,None,359,2.185792349726776,40,1.67
Qwen2.5-Coder-32B-Instruct,32.0,6,32.0,20.23,0.0,None,353,3.8251366120218577,28,5.33
QwQ-32B,33.0,6,33.0,14.78,0.0,None,353,3.8251366120218577,27,5.5
DeepSeek-R1-Distill-Qwen-14B,20.0,6,20.0,9.26,0.0,None,356,3.0054644808743167,40,3.33
DeepSeek-R1-Distill-Qwen-32B,27.0,6,27.0,11.22,0.0,None,354,3.551912568306011,30,4.5
DeepSeek-R1-Distill-Qwen-7B,0.0,6,0.0,0.0,0.0,None,363,1.092896174863388,46,0.0
DeepSeek-R1-Distill-Llama-8B,0.0,6,0.0,1.36,0.0,None,363,1.092896174863388,47,0.0
Codestral-22B-v0.1,0.0,6,0.0,2.67,0.0,None,363,1.092896174863388,48,0.0
