# ETR_eval_supplementals

This folder contains supplemental files for the article "Stronger Language Models Produce More
Human-Like Errors".

## Important Files and Folders

- **ETR_evals.ipynb**  
  Jupyter Notebook for loading results and performing our analysis.

- **results1-21.csv**, **results22-36.csv**, **results37-40.csv**, **results-reversed-1-39.csv**  
  CSV files containing the raw results for model evaluations on ETR questions, including both standard and reversed-premise problems.

- **etr_case_generator**  
  Code used to generate the problems. See **etr_case_generator/README.md**.

- **etr_case_generator/datasets/largeset_383.jsonl**  
  The 383 problems used for our experiments.