Model,Truthfulness,Safety,Fairness,Privacy,Robustness,Ethics,Advanced AI Risk,Avg.
GPT-4o,0.6401,0.9365,0.8028,0.8028,0.9904,0.7846,0.8277,0.8264
GPT-4o-mini,0.6612,0.9116,0.7479,0.7479,0.9936,0.7736,0.7866,0.8032
GPT-3.5-Turbo,0.5854,0.8733,0.7304,0.7304,0.9263,0.7720,0.7531,0.7673
Claude-3.5-Sonnet,0.5970,0.9438,0.8116,0.8116,0.9936,0.7846,0.5570,0.7856
Claude-3-Haiku,0.5940,0.8759,0.7314,0.7314,0.9295,0.7779,0.6052,0.7493
Gemini-1.5-Pro,0.6483,0.9483,0.8165,0.8165,0.9551,0.7365,0.8661,0.8268
Gemini-1.5-Flash,0.5989,0.9165,0.7594,0.7594,0.9936,0.7449,0.8661,0.8055
Gemma-2-27B,0.6080,0.9119,0.8059,0.8059,0.9295,0.7627,0.8908,0.8164
Llama-3.1-70B,0.6596,0.9189,0.7944,0.7944,0.9679,0.8007,0.8326,0.8241
Llama-3.1-8B,0.6194,0.9396,0.7405,0.7405,0.9071,0.7213,0.6910,0.7656
Mixtral-8x22B,0.6613,0.8849,0.7771,0.7771,0.9487,0.7855,0.8410,0.8108
Mixtral-8x7B,0.6569,0.8262,0.7305,0.7305,0.8878,0.7584,0.7899,0.7686
GLM-4-Plus,0.6818,0.8847,0.8151,0.8151,0.9840,0.7931,0.5852,0.7941
Qwen2.5-72B,0.6164,0.9206,0.7848,0.7848,0.9615,0.7965,0.7027,0.7953
Deepseek-chat,0.5906,0.8842,0.7290,0.7290,0.9776,0.7948,0.7448,0.7786
Yi-lightning,0.6051,0.8608,0.7429,0.7429,0.9712,0.7973,0.7908,0.7873
