# Supplementary Code for Rare Event Analysis of Large Language Models

We provide minimal demo code for the sampling and analysis methodologies discussed in the paper "Rare Event Analysis of Large Language Models". 

The notebook `sampling_examples.ipynb` provides code to sample the biased ARI and Log-Probs scores from the model using Annealed TPS, as well as to sample IID unbiased scores. The final two cells of this notebook will save data to `/data` that can be analysed in the second notebook, `reweighting_example.ipynb`. This notebook provides code for applying burnin, computing the Gelman Rubin Statistic, MBAR and bootstrapping. 

Within both notebooks there is commented out code providing the parameters used within the original model. However, it should be noted that the code provided is a minimal recreation of the code used, and as such, will be much less efficient due to lack of batching, high performance computing integration, etc. 

Finally, we provide json files in the directiory `/data/paper` that contain the data used to generate the ARI Cumulative Trajectories (Figure 3) and histograms (Figure 4). This can be analysed in `reweighting_example.ipynb` by setting `USE_EXAMPLE_DATA = FALSE` in the second cell of the notebook.

# Environment Setup

First, make sure [uv](https://docs.astral.sh/uv/) is installed by running `curl -LsSf https://astral.sh/uv/install.sh | sh` (use `source $HOME/.local/bin/env` to add it to the path without restarting the terminal). Next, run `uv sync` in the folder with the pyproject.toml file to create a virtual environment with the required dependencies.