Model_Name,Benign_Injection_Rate,Total_Cost,Data,AUROC,best_threshold,best_F1,best_accuracy,best_precision,best_recall,best_TP,best_TN,best_FP,best_FN,best_too_late_count,best_never_triggered_count,best_benign_but_flagged_as_harmful_count,threshold_0_5_F1,threshold_0_5_accuracy,threshold_0_5_precision,threshold_0_5_recall,threshold_0_5_TP,threshold_0_5_TN,threshold_0_5_FP,threshold_0_5_FN,threshold_0_5_too_late_count,threshold_0_5_never_triggered_count,threshold_0_5_benign_but_flagged_as_harmful_count
gpt-4o,0.0,17.3897,../data/image_tasks/test_data.json,0.9942382271468144,0.30000000000000004,0.9502762430939227,0.9526315789473684,1.0,0.9052631578947369,86,95,0,9,0,9,0,0.888888888888889,0.9,1.0,0.8,76,95,0,19,0,19,0
