模型名称,安全影响分类,实验结果为true的数据数,实验结果为false的数据数,成功率
claude-4,UDM,33,28,54.1
claude-4,UDR,36,31,53.73
claude-4,DoS,40,35,53.33
claude-4,所有类型汇总,109,94,53.69
deepseek,UDM,26,35,42.62
deepseek,UDR,22,45,32.84
deepseek,DoS,29,46,38.67
deepseek,所有类型汇总,77,126,37.93
gemini-2.5,UDM,37,24,60.66
gemini-2.5,UDR,4,63,5.97
gemini-2.5,DoS,34,41,45.33
gemini-2.5,所有类型汇总,75,128,36.95
gpt-4.1,UDM,45,16,73.77
gpt-4.1,UDR,36,28,56.25
gpt-4.1,DoS,31,44,41.33
gpt-4.1,所有类型汇总,112,88,56.0
qwen,UDM,18,43,29.51
qwen,UDR,18,49,26.87
qwen,DoS,43,32,57.33
qwen,所有类型汇总,79,124,38.92
