Performance Analysis of GPU-Based Convolutional Neural Networks

Published: 2016, Last Modified: 07 May 2025ICPP 2016EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: As one of the most important deep learning models, convolutional neural networks (CNNs) have achieved great successes in a number of applications such as image classification, speech recognition and nature language understanding. Training CNNs on large data sets is computationally expensive, leading to a flurry of research and development of open-source parallel implementations on GPUs. However, few studies have been performed to evaluate the performance characteristics of those implementations. In this paper, we conduct a comprehensive comparison of these implementations over a wide range of parameter configurations, investigate potential performance bottlenecks and point out a number of opportunities for further optimization.
Loading