To run the code, first install the library locally using `pip install .`
and the dependencies 
```
pip install matplotlib numpy scipy sentencepiece datasets autograd
```

The scripts to generate the figures are in `scripts/`.
The real-data scripts require downloading OpenWebText, training tokenizers and computing frequencies.
This can take some time and quite a bit of hard drive space (~100GB).

Figure 1: bad_scaling.py 
Figure 2: plot-real-data.py (download_owt.py has to be run first)
Figure 3: download_owt.py
Figure 4: gd-plots.py
Figure 5: illustration-sign-assumption.py
Figure 6: sign-plots.py
Figure 7: rate_comp.py