Model,feedback,correction,comparison,overall,opensourced,link
GPT-4-turbo,7.84,7.69,8.04,7.86,FALSE,https://openai.com/blog/new-models-and-developer-products-announced-at-devday
GLM4-no-tool,7.49,8.10,6.8,7.46,FALSE,https://zhipuai.cn/devday
Qwen-Max,6.65,8.21,6.55,7.14,FALSE,https://dashscope.aliyun.com/
ErnieBot Pro,6.31,7.98,5.88,6.72,FALSE,-
Claude-instant-1,5.88,7.72,5.76,6.45,FALSE,https://www.anthropic.com/product
Baichuan2 Turbo,5.54,7.65,4.90,6.03,FALSE,https://www.baichuan-ai.com/home
GPT-3.5-turbo,5.21,7.55,4.92,5.89,FALSE,https://openai.com/blog/new-models-and-developer-products-announced-at-devday
Gemini-Pro,4.94,7.49,4.29,5.57,FALSE,https://deepmind.google/technologies/gemini/
MiniMax-abab5,4.77,6.81,4.19,5.26,FALSE,https://api.minimax.chat/document/guides/chat-pro?id=64b79fa3e74cddc5215939f4
PaLM,3.8,6.09,3.87,4.59,FALSE,https://developers.googleblog.com/2023/03/announcing-palm-api-and-makersuite.html
Qwen-72B-Chat,5.57,7.45,5.02,6.01,TRUE,https://huggingface.co/Qwen/Qwen-72B-Chat
DeepSeek-67B,5.53,7.30,4.69,5.84,TRUE,https://huggingface.co/deepseek-ai/deepseek-llm-67b-chat
Mixtral-8x7B,5.31,7.33,4.62,5.75,TRUE,https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1
WizardLM-70B-v1.0,3.76,5.37,3.36,4.16,TRUE,https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1
Llama2-70B-Chat,4.12,7.11,3.95,5.00,TRUE,https://huggingface.co/meta-llama/Llama-2-70b-chat-hf
Auto-J-13B,4.21,0,4.63,4.42,TRUE,https://huggingface.co/GAIR/autoj-13b
UltraCM-13B,4.12,0,0,4.12,TRUE,https://huggingface.co/openbmb/UltraCM-13b
InternLM2-20B-Chat,6.03,7.48,5.1,6.20,TRUE,https://huggingface.co/internlm/internlm2-chat-20b
Qwen-14B-Chat,4.81,7.25,3.98,5.35,TRUE,https://huggingface.co/Qwen/Qwen-14B-Chat
Vicuna-33B-v1.3,3.82,6.93,3.95,4.90,TRUE,https://huggingface.co/lmsys/vicuna-33b-v1.3
Baichuan2-13B,3.23,6.8,3.49,4.51,TRUE,https://huggingface.co/baichuan-inc/Baichuan2-13B-Chat
Yi-34B-Chat,3.58,6.25,3.35,4.39,TRUE,https://huggingface.co/01-ai/Yi-34B-Chat
WizardLM-13B-v1.2,3.50,6.43,3.16,4.36,TRUE,https://huggingface.co/WizardLM/WizardLM-13B-V1.2
Llama2-13B-Chat,3.70,7.11,3.32,4.92,TRUE,https://huggingface.co/meta-llama/Llama-2-13b-chat-hf
InternLM2-7B-Chat,5.20,7.17,4.62,5.66,TRUE,https://huggingface.co/internlm/internlm2-chat-7b
Mistral-7B-ins-v0.2,4.70,7.2,4.28,5.39,TRUE,https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2
Qwen-7B-Chat,4.05,6.38,3.47,4.63,TRUE,https://huggingface.co/Qwen/Qwen-7B-Chat
DeepSeek-7B,3.44,6.06,3.6,4.37,TRUE,https://huggingface.co/deepseek-ai/deepseek-llm-7b-chat
Vicuna-7B-v1.3,3.82,5.61,2.98,4.14,TRUE,https://huggingface.co/lmsys/vicuna-7b-v1.3
Baichuan2-7B-Chat,3.74,5.48,3.1,4.11,TRUE,https://huggingface.co/baichuan-inc/Baichuan2-7B-Chat
ChatGLM3-6B,3.73,5.09,3.03,3.95,TRUE,https://huggingface.co/THUDM/chatglm3-6b
Yi-6B-Chat,2.8,4.35,2.39,3.18,TRU,https://huggingface.co/01-ai/Yi-6B-Chat
Llama2-7B-Chat,3.44,6.26,3.21,4.30,TRUE,https://huggingface.co/meta-llama/Llama-2-7b-chat-hf