This repo is for submission #1267 **NormSoftmax: Normalize the Input of Softmax to Accelerate and Stabilize Training** at ICLR 2023.

Run `python train-vit.py` to train a vision transformer on CIFAR10 dataset.