-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_10000 <<test data>>: total_testing_cleaned_python_level_1.json.json
Number of testing APIs: 54
Overall Endpoint Match Accuracy: 0.048
Overall API Call Match Accuracy: 0.032
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_10000 <<test data>>: total_testing_cleaned_python_level_2.json.json
Number of testing APIs: 900
Overall Endpoint Match Accuracy: 0.091
Overall API Call Match Accuracy: 0.070
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_10000 <<test data>>: total_testing_cleaned_python_level_3.json.json
Number of testing APIs: 116
Overall Endpoint Match Accuracy: 0.105
Overall API Call Match Accuracy: 0.057
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_10000 <<test data>>: total_testing_cleaned_python_level_1_retrieval_IC_3.json.json
Number of testing APIs: 54
Overall Endpoint Match Accuracy: 0.616
Overall API Call Match Accuracy: 0.534
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_10000 <<test data>>: total_testing_cleaned_python_level_2_retrieval_IC_3.json.json
Number of testing APIs: 900
Overall Endpoint Match Accuracy: 0.540
Overall API Call Match Accuracy: 0.484
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_10000 <<test data>>: total_testing_cleaned_python_level_3_retrieval_IC_3.json.json
Number of testing APIs: 116
Overall Endpoint Match Accuracy: 0.562
Overall API Call Match Accuracy: 0.496
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_20000 <<test data>>: total_testing_cleaned_python_level_1.json.json
Number of testing APIs: 54
Overall Endpoint Match Accuracy: 0.144
Overall API Call Match Accuracy: 0.103
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_20000 <<test data>>: total_testing_cleaned_python_level_2.json.json
Number of testing APIs: 900
Overall Endpoint Match Accuracy: 0.159
Overall API Call Match Accuracy: 0.133
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_20000 <<test data>>: total_testing_cleaned_python_level_3.json.json
Number of testing APIs: 116
Overall Endpoint Match Accuracy: 0.142
Overall API Call Match Accuracy: 0.089
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_20000 <<test data>>: total_testing_cleaned_python_level_1_retrieval_IC_3.json.json
Number of testing APIs: 54
Overall Endpoint Match Accuracy: 0.635
Overall API Call Match Accuracy: 0.555
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_20000 <<test data>>: total_testing_cleaned_python_level_2_retrieval_IC_3.json.json
Number of testing APIs: 900
Overall Endpoint Match Accuracy: 0.568
Overall API Call Match Accuracy: 0.514
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_20000 <<test data>>: total_testing_cleaned_python_level_3_retrieval_IC_3.json.json
Number of testing APIs: 116
Overall Endpoint Match Accuracy: 0.561
Overall API Call Match Accuracy: 0.491
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_40000 <<test data>>: total_testing_cleaned_python_level_1.json.json
Number of testing APIs: 54
Overall Endpoint Match Accuracy: 0.251
Overall API Call Match Accuracy: 0.192
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_40000 <<test data>>: total_testing_cleaned_python_level_2.json.json
Number of testing APIs: 900
Overall Endpoint Match Accuracy: 0.212
Overall API Call Match Accuracy: 0.180
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_40000 <<test data>>: total_testing_cleaned_python_level_3.json.json
Number of testing APIs: 116
Overall Endpoint Match Accuracy: 0.182
Overall API Call Match Accuracy: 0.135
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_40000 <<test data>>: total_testing_cleaned_python_level_1_retrieval_IC_3.json.json
Number of testing APIs: 54
Overall Endpoint Match Accuracy: 0.648
Overall API Call Match Accuracy: 0.571
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_40000 <<test data>>: total_testing_cleaned_python_level_2_retrieval_IC_3.json.json
Number of testing APIs: 900
Overall Endpoint Match Accuracy: 0.554
Overall API Call Match Accuracy: 0.498
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_40000 <<test data>>: total_testing_cleaned_python_level_3_retrieval_IC_3.json.json
Number of testing APIs: 116
Overall Endpoint Match Accuracy: 0.590
Overall API Call Match Accuracy: 0.527
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_80000 <<test data>>: total_testing_cleaned_python_level_1.json.json
Number of testing APIs: 54
Overall Endpoint Match Accuracy: 0.295
Overall API Call Match Accuracy: 0.231
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_80000 <<test data>>: total_testing_cleaned_python_level_2.json.json
Number of testing APIs: 900
Overall Endpoint Match Accuracy: 0.242
Overall API Call Match Accuracy: 0.201
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_80000 <<test data>>: total_testing_cleaned_python_level_3.json.json
Number of testing APIs: 116
Overall Endpoint Match Accuracy: 0.229
Overall API Call Match Accuracy: 0.195
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_80000 <<test data>>: total_testing_cleaned_python_level_1_retrieval_IC_3.json.json
Number of testing APIs: 54
Overall Endpoint Match Accuracy: 0.640
Overall API Call Match Accuracy: 0.565
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_80000 <<test data>>: total_testing_cleaned_python_level_2_retrieval_IC_3.json.json
Number of testing APIs: 900
Overall Endpoint Match Accuracy: 0.573
Overall API Call Match Accuracy: 0.514
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_80000 <<test data>>: total_testing_cleaned_python_level_3_retrieval_IC_3.json.json
Number of testing APIs: 116
Overall Endpoint Match Accuracy: 0.612
Overall API Call Match Accuracy: 0.545
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_98914 <<test data>>: total_testing_cleaned_python_level_1.json.json
Number of testing APIs: 54
Overall Endpoint Match Accuracy: 0.349
Overall API Call Match Accuracy: 0.272
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_98914 <<test data>>: total_testing_cleaned_python_level_2.json.json
Number of testing APIs: 900
Overall Endpoint Match Accuracy: 0.283
Overall API Call Match Accuracy: 0.232
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_98914 <<test data>>: total_testing_cleaned_python_level_3.json.json
Number of testing APIs: 116
Overall Endpoint Match Accuracy: 0.230
Overall API Call Match Accuracy: 0.201
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_98914 <<test data>>: total_testing_cleaned_python_level_1_retrieval_IC_3.json.json
Number of testing APIs: 54
Overall Endpoint Match Accuracy: 0.654
Overall API Call Match Accuracy: 0.576
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_98914 <<test data>>: total_testing_cleaned_python_level_2_retrieval_IC_3.json.json
Number of testing APIs: 900
Overall Endpoint Match Accuracy: 0.594
Overall API Call Match Accuracy: 0.533
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_98914 <<test data>>: total_testing_cleaned_python_level_3_retrieval_IC_3.json.json
Number of testing APIs: 116
Overall Endpoint Match Accuracy: 0.611
Overall API Call Match Accuracy: 0.548
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_1000000 <<test data>>: total_testing_cleaned_python_level_1.json.json
Number of testing APIs: 54
Overall Endpoint Match Accuracy: 0.772
Overall API Call Match Accuracy: 0.695
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_1000000 <<test data>>: total_testing_cleaned_python_level_2.json.json
Number of testing APIs: 900
Overall Endpoint Match Accuracy: 0.538
Overall API Call Match Accuracy: 0.473
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_1000000 <<test data>>: total_testing_cleaned_python_level_3.json.json
Number of testing APIs: 116
Overall Endpoint Match Accuracy: 0.448
Overall API Call Match Accuracy: 0.402
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_1000000 <<test data>>: total_testing_cleaned_python_level_1_retrieval_IC_3.json.json
Number of testing APIs: 54
Overall Endpoint Match Accuracy: 0.744
Overall API Call Match Accuracy: 0.693
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_1000000 <<test data>>: total_testing_cleaned_python_level_2_retrieval_IC_3.json.json
Number of testing APIs: 900
Overall Endpoint Match Accuracy: 0.573
Overall API Call Match Accuracy: 0.519
-------------------------------------------------
<<model>>: CodeLlama-13b-hf <<train data>>: cleaned_python/tokenized_combined_data_1000000 <<test data>>: total_testing_cleaned_python_level_3_retrieval_IC_3.json.json
Number of testing APIs: 116
Overall Endpoint Match Accuracy: 0.640
Overall API Call Match Accuracy: 0.561