# Statistical Advantage of Softmax Attention: Insights from Single-Location Regression

We provide the code to run our simulations and predictions. The description of the input parameters and outputs are given in the programs.

Requirements :
* python 3.11.7
* numpy 2.3.1
* scipy 1.16.0
* pytorch 2.7.0

## Population loss

**Bayes risk**

`python BO_pop.py nu Lmax Lrandom model`

**Optimal population risk of the attention**

`python att_pop.py nu Lmax Lrandom model sigmaT`

## Finite sample complexity

**Bayes-optimal test risk for max-SLR**

`python BO_maxSLR.py alpha L initI`

**Bayes-optimal test risk for spiked-SLR**

`python BO_spikedSLR.py alpha nu L initI`

**Test risk of the attention**

`python att.py alpha nu L model rv rk sigmaT initI`

**Simulation of the attention**

`python att_empirical.py D alpha nu L Lrandom model rv rk sigmaT initI`