Training with Quantization Noise for Extreme Model CompressionDownload PDFOpen Website

2021 (modified: 14 Nov 2021)ICLR 2021Readers: Everyone
Abstract: We tackle the problem of producing compact models, maximizing their accuracy for a given model size. A standard solution is to train networks with Quantization Aware Training, where the weights are...
0 Replies

Loading