torch
transformers
datasets
accelerate>=0.26.0