2021 (modified: 14 Nov 2021)ICLR 2021Readers: Everyone
Abstract:We tackle the problem of producing compact models, maximizing their accuracy for a given model size. A standard solution is to train networks with Quantization Aware Training, where the weights are...