# Understanding the Emergence of Seemingly Useless Features in Next-Token Predictors
## Supplementary code

This archive contains the code used for the experiments presented in the paper.

`train.py` and the `src/` folder provide the core classes and methods to run the experiments in Section 4 (Experiments with Small Transformers). We use Weights & Biases Sweeps to run these experiments. The configuration files are in the `conf/` and `sweeps/` folders. To reproduce the figures, download the experimental results from Weights & Biases cloud storage. We do not include the visualization code here.

Additionally, `sae_analyze.py` and the `sae_ipynb/` folder contain the remaining code to reproduce Section 5 (Investigating Features in LLMs).

To run the code, you need Python 3 and the following packages:
 - datasets
 - editdistance
 - hydra-core
 - matplotlib
 - numpy
 - omegaconf
 - packaging
 - requests
 - sae-lens
 - scikit-learn
 - scipy
 - spacy
 - torch
 - tqdm
 - transformer-lens
 - transformers
 - wandb