skill,score,count,avg,normalize
robustness,51.0,19,2.6842105263157894,11.736842105263158
correctness,96.0,50,1.92,8.68
efficiency,47.0,20,2.35,10.4
factuality,110.0,44,2.5,11.0
commonsense,156.0,51,3.0588235294117645,13.235294117647058
comprehension,268.0,99,2.707070707070707,11.828282828282829
insightfulness,39.0,12,3.25,14.0
completeness,80.0,29,2.7586206896551726,12.03448275862069
metacognition,76.0,21,3.619047619047619,15.476190476190476
readability,56.0,14,4.0,17.0
conciseness,67.0,19,3.526315789473684,15.105263157894736
harmlessness,107.0,25,4.28,18.12
