Toggle navigation
OpenReview
.net
Login
×
Go to
DBLP
homepage
MegaScale: Scaling Large Language Model Training to More Than 10, 000 GPUs
Ziheng Jiang
,
Haibin Lin
,
Yinmin Zhong
,
Qi Huang
,
Yangrui Chen
,
Zhi Zhang
,
Yanghua Peng
,
Xiang Li
,
Cong Xie
,
Shibiao Nong
,
Yulu Jia
,
Sun He
,
Hongmin Chen
,
Zhihao Bai
,
Qi Hou
,
Shipeng Yan
,
Ding Zhou
,
Yiyao Sheng
,
Zhuo Jiang
,
Haohan Xu
et al. (12 additional authors not shown)
Published: 01 Jan 2024, Last Modified: 01 Oct 2024
NSDI 2024
Everyone
Revisions
BibTeX
CC BY-SA 4.0
Loading