HelixTrain: Enhancing Long-Context LLM Training via 3D Dynamic Parallelism

10 Nov 2025 (modified: 12 Nov 2025)THU 2025 Fall AML SubmissionEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Distributed Training, Long Context Training
Abstract: NONE
Submission Number: 42
Loading