# MixEval


## Install
1. `cd MixEval`
3. `conda create -n MixEval python=3.11 --yes`
4. `conda activate MixEval`
5. `bash setup.sh`

## Evaluate
1. `bash mix_eval/eval_scripts/run_evaluate_8A10080G.sh`
2. **After finishing all evals, we should inspect both of the below logs to ensure the results are complete:**
    - The error log at
        - `mix_eval/data/model_responses_8A10080G/close_freeform/error.log`, 
        - `mix_eval/data/model_responses_8A10080G/close_multichoice/error.log`,
        - `mix_eval/data/model_responses_8A10080G/open/error.log` 
    
    - The final check of the eval results: 
        - `mix_eval/data/model_responses_8A10080G/eval_checks.log`