Multi-distribution noise quantisation: an extreme compression scheme for transformer according to parameter distribution
Abstract: With the development of deep learning, neural networks are widely used in various fields, and the improved model performance also introduces a considerable number of parameters and computations. Mo...
0 Replies
Loading