Toggle navigation
OpenReview
.net
Login
×
Go to
CORR 2023
homepage
SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient
Max Ryabinin
,
Tim Dettmers
,
Michael Diskin
,
Alexander Borzunov
2023 (modified: 14 Apr 2023)
CoRR 2023
Readers:
Everyone
0 Replies
Loading