Codes for ICML Submission #11091 -- The Expressivity Limits of Transformers

## Requirements

- Python 3.7+
- PyTorch 1.9+
- Transformers 4.21+
- NumPy
- Pandas
- scikit-learn
- tqdm
- matplotlib (for visualization)
- seaborn (for visualization)

## Project Structure

```
./
├── cramming/
│   ├── data/
│   │   └──download_texts.sh                # download the necessary datasets for cramming
│   ├── scripts/
│   │   └──run_replicability_adaptive.sh    # run cramming 
│   └── plot_accessibility.ipynb            # show cramming results 
├── download_llms.py                        # download necessary HF LLM for the three exps 
├── utils/                                  # codes to easily launch hf models with hooks  
│   ├── __init__.py                         # and visualize next-token cells 
│   ├── base_llm.py
│   ├── embedding_space_visualizer.py
│   ├── hf_llm.py
│   ├── model_factory.py
│   └── token_partition_analyzer.py
├── models_list.csv                         # constants for each model
├── th_bounds_estimation.ipynb              # notebook to estimate ratios 
├── generate_embeddings.py                  # generate embeddings used to find the supports parameters
├── cell_volume.py                          # estimate the volume distribution given rectangle support  
├── cell_convolution.py                     # estimate the volume distributions of n-grams
└── cell_volume_tests.ipynb                 # exploratory analysis of cell volumes 
```

## License

MIT License
