Model_Name,Benign_Injection_Rate,threshold,F1,Accuracy,Precision,Recall,TP,TN,FP,FN,Too_Early,Too_Late,Never_Triggered,Benign_Flagged,Total_Tasks,Total_Cost,Data
o3-mini,0.0,0.5,0.9444444444444444,0.9473684210526315,1.0,0.8947368421052632,85,95,0,10,0,0,10,0,190,1.2926661,../data/image_tasks/test_data.json
