Model_Name,Benign_Injection_Rate,threshold,F1,Accuracy,Precision,Recall,TP,TN,FP,FN,Too_Early,Too_Late,Never_Triggered,Benign_Flagged,Total_Tasks,Total_Cost,Data
o3-mini,1.0,0.5,0.5907172995780591,0.7095808383233533,1.0,0.41916167664670656,70,167,0,97,0,40,57,0,334,3.686190199999995,../data/decomposed_queries/test_data.json
