Training with Quantization Noise for Extreme Model Compression

Pierre Stock, Angela Fan, Benjamin Graham, Edouard Grave, Rémi Gribonval, Hervé Jégou, Armand Joulin

2021 (modified: 14 Nov 2021)ICLR 2021Readers: Everyone

Abstract: We tackle the problem of producing compact models, maximizing their accuracy for a given model size. A standard solution is to train networks with Quantization Aware Training, where the weights are...

0 Replies