This readme is anonymized from the original version.

All model outputs (excluding those of PaLM) are given in `model_outputs_ood.zip`.
The script to analyze the model output for a single experiment is `python analyze_results.py <filename>`.
The script to generate new examples is `python run_experiment.py ...`. The generated examples for our experiments is given in `generated_data.zip`.
The script to produce all figures in the manuscript is `python make_plots.py`.
