numpy
tqdm
torch
huggingface-hub
datasets
tiktoken
transformers