## Code for LLM experiments

### pretraining
Our setup follows classical GPT-2 like model pretraining, using megatron-deepspeed repository.
For training in different weight directions from given checkpoints 
we just provide reference scripts `reorder_v01_step02_4x7x4000.sh` and `reorder_v01_step03_4x21x4000.sh`.

### eval
Formulae for Hessian-vector-vector products are evaluated in `calc_hvvp.py`.
The results for these calculation, as well as perplexity results, 
are converted into tables and figures, in `plots_and_figures_llm.ipynb` notebook.

