Structure of the supplementary materials:

 - appendix.pdf: The appendix containing: percentage of log-likelihood
   improvement from context sliced by POS tag, examples of the visualization,
   model training steps per second and licenses of the assets used in this work.
 - visualization/top_1pct: Contains the log likelihood visualizations sampled
   from the context-input-target triplets with top 1% of the context utility.
 - visualization/random: Randomly sample from the C4 train split.
