Pruning Depthwise Separable Convolutions for Extra Efficiency Gain of Lightweight Models

Cheng-Hao Tu; Jia-Hong Lee; Yi-Ming Chan; Chu-Song Chen

Pruning Depthwise Separable Convolutions for Extra Efficiency Gain of Lightweight Models

Cheng-Hao Tu, Jia-Hong Lee, Yi-Ming Chan, Chu-Song Chen

25 Sept 2019 (modified: 05 May 2023)ICLR 2020 Conference Withdrawn SubmissionReaders: Everyone

Abstract: Deep convolutional neural networks are good at accuracy while bad at efficiency. To improve the inference speed, two kinds of directions are developed, lightweight model designing and network weight pruning. Lightweight models have been proposed to improve the speed with good enough accuracy. It is, however, not trivial if we can further speed up these “compact” models by weight pruning. In this paper, we present a technique to gradually prune the depthwise separable convolution networks, such as MobileNet, for improving the speed of this kind of “dense” network. When pruning depthwise separable convolutions, we need to consider more structural constraints to ensure the speedup of inference. Instead of pruning the model with the desired ratio in one stage, the proposed multi-stage gradual pruning approach can stably prune the filters with a finer pruning ratio. Our method achieves 1.68 times speedup with neglectable accuracy drop for MobileNetV2.

Keywords: Deep Learning, Network Pruning, Lightweight CNN

Original Pdf: pdf

6 Replies

Loading