DRAGONN: Distributed Randomized Approximate Gradients of Neural NetworksDownload PDFOpen Website

Published: 01 Jan 2022, Last Modified: 06 May 2023ICML 2022Readers: Everyone
Abstract: Data-parallel distributed training (DDT) has become the de-facto standard for accelerating the training of most deep learning tasks on massively parallel hardware. In the DDT paradigm, the communic...
0 Replies

Loading