Published: 01 Jan 2022, Last Modified: 06 May 2023ICML 2022Readers: Everyone
Abstract:Data-parallel distributed training (DDT) has become the de-facto standard for accelerating the training of most deep learning tasks on massively parallel hardware. In the DDT paradigm, the communic...