tiktoken
torch=>=2.9.0
tqdm==4.67.1
fast-hadamard-transform
schedulefree
transformers
wandb
datasets
zstandard
scipy