Scaling Deep Learning workloads: NVIDIA DGX-1/Pascal and Intel Knights Landing

Published: 01 Jan 2020, Last Modified: 05 Mar 2025Future Gener. Comput. Syst. 2020EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•We have provided a detailed performance and power scaling analysis of important CNN workloads on two architectures: (a) NVIDIA DGX-1 (eight Pascal P100 GPUs interconnected with NVLink) and (b) a cluster with Intel Knights Landing (KNL) CPUs interconnected with Intel Omni-Path.•For ML workloads considered here, GPUs provide the highest overall raw performance. We also find that a single KNL can be competitive with a single Pascal in certain cases. Focusing DL architectural innovation on FLOPs can be misguided.•The importance of the interconnect is highly dependent on neural network architecture.
Loading

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview