TREC: Transient Redundancy Elimination-based Convolution

Jiawei Guan; Feng Zhang; Jiesong Liu; Hsin-Hsuan Sung; Ruofan Wu; Xiaoyong Du; Xipeng Shen

TREC: Transient Redundancy Elimination-based Convolution

Jiawei Guan, Feng Zhang, Jiesong Liu, Hsin-Hsuan Sung, Ruofan Wu, Xiaoyong Du, Xipeng Shen

Published: 31 Oct 2022, Last Modified: 12 Jan 2023NeurIPS 2022 AcceptReaders: Everyone

Keywords: Transient redundancy, convolution acceleration, back-propagation

Abstract: The intensive computations in convolutional neural networks (CNNs) pose challenges for resource-constrained devices; eliminating redundant computations from convolution is essential. This paper gives a principled method to detect and avoid transient redundancy, a type of redundancy existing in input data or activation maps and hence changing across inferences. By introducing a new form of convolution (TREC), this new method makes transient redundancy detection and avoidance an inherent part of the CNN architecture, and the determination of the best configurations for redundancy elimination part of CNN backward propagation. We provide a rigorous proof of the robustness and convergence of TREC-equipped CNNs. TREC removes over 96% computations and achieves 3.51x average speedups on microcontrollers with minimal (about 0.7%) accuracy loss.

TL;DR: This paper gives a principled method to detect and avoid transient redundancy of CNNs.

Supplementary Material: pdf

10 Replies

Loading