CSQ: Centered Symmetric Quantization for Extremely Low Bit Neural Networks

Faaiz Asim; Jaewoo Park; Azat Azamat; Jongeun Lee

CSQ: Centered Symmetric Quantization for Extremely Low Bit Neural Networks

Faaiz Asim, Jaewoo Park, Azat Azamat, Jongeun Lee

Published: 28 Jan 2022, Last Modified: 13 Feb 2023ICLR 2022 SubmittedReaders: Everyone

Keywords: deep learning, classification, low precision, uniform symmetric quantization, binary neural network hardware

Abstract: Recent advances in quantized neural networks (QNNs) are closing the performance gap with the full precision neural networks. However at very low precision (i.e., $\le 3$-bits), QNNs often still suffer significant performance degradation. The conventional uniform symmetric quantization scheme allocates unequal numbers of positive and negative quantization levels. We show that this asymmetry in the number of positive and negative quantization levels can result in significant quantization error and performance degradation at low precision. We propose and analyze a quantizer called centered symmetric quantizer (CSQ), which preserves the symmetry of latent distribution by providing equal representations to the negative and positive sides of the distribution. We also propose a novel method to efficiently map CSQ to binarized neural network hardware using bitwise operations. Our analyses and experimental results using state-of-the-art quantization methods on ImageNet and CIFAR-10 show the importance of using CSQ for weight in place of the conventional quantization scheme at extremely low-bit precision (2$\sim$3 bits).

One-sentence Summary: A simple trick for extremely low-bit quantization with an in-depth analysis and an efficient bit-parallel realization method

14 Replies

Loading