
`self_consistency.py` generates entailment matrices based on savevd responses from LLMs (see an example of the folder structure in `./answers_nq`). Example usage:
- `python self_consistency.py --gpu 0 --type "labels_logits_and_reps" --dirname "./answers_nq/meta-llama_llama-3-70b-instruct_p_0.9_temp_1.0_samples_100" --trial 0 --subset 10`


`cross_consistency.py` generates cross-model entailment matrices based on savevd responses from LLMs. Example usage:
- `python cross_consistency.py --gpu 0 --type "labels_logits_and_reps" --dirnameA "./answers_nq/meta-llama_llama-3-70b-instruct_p_0.9_temp_1.0_samples_100" --dirnameB "./answers_nq/ibm-mistralai_merlinite-7b_p_0.9_temp_1.0_samples_100" --trial 0 --subset 10`


`two_stage_detection/detector_train_test.py` evaluates the two-stage detection algorithm and saves the resuls. Example usage:
- `python detector_train_test.py --metric "auroc" --train_type 'both' --dataset "nq" --model_target "mistralai/mixtral-8x7b-instruct-v01" --model_verifier "meta-llama/llama-3-70b-instruct" --n_subset 10 --cross_type "multiple"`


`./two_stage_detection/saved_reults_both/analyze_budget.ipynb` plots performance against computational cost.