Data-Centric and Heterogeneity-Adaptive Sequence Parallelism for Efficient LLM Training.

Yujie Wang, Shiju Wang, Shenhan Zhu, Fangcheng Fu, Xinyi Liu, Xuefeng Xiao 0001, Huixia Li, Jiashi Li, Faming Wu, Bin Cui 0001

15 Jan 2026 (modified: 25 Jan 2026)CoRR 2024EveryoneRevisionsCC BY-SA 4.0
Loading