,OpenAI,,,Anthropic,,Google,,,Meta,,,Mistral AI,,,Alibaba,,,,DeepSeek AI,
,GPT-3.5-turbo,GPT-4o-mini,GPT-4o,Claude-3-Haiku,Claude-3-Sonnet,Gemini-1.5-Pro,Gemini-1.5-Flash,Gemma-2-9b-it,LlaMA-3.1-8b-Instruct,LlaMA-3.1-70B-Instruct,LlaMA-3.1-405B-Instruct,7B-Instruct,Codestral,Large-V2,Qwen2-1.5B-instruct,Qwen2.5-Coder-7B-Instruct,Qwen2.5-7B-Instruct,Qwen2.5-72B-Instruct,Coder-V2,Chat-V2
Parsable,80.63%,98.13%,99.38%,95.63%,92.50%,95.63%,16.25%,0.00%,0.00%,0.00%,1.88%,0.00%,53.75%,97.50%,0.00%,86.25%,0.00%,93.13%,83.75%,87.50%
Solvable,73.75%,96.25%,97.50%,94.38%,89.38%,95.00%,15.63%,0.00%,0.00%,0.00%,1.88%,0.00%,52.50%,96.25%,0.00%,81.88%,0.00%,91.25%,81.88%,86.25%
Equivalent,25.00%,28.13%,50.63%,38.75%,38.75%,46.88%,9.38%,0.00%,0.00%,0.00%,1.25%,0.00%,20.00%,50.63%,0.00%,29.38%,0.00%,48.13%,31.88%,34.38%
