Up or Down? Adaptive Rounding for Post-Training QuantizationDownload PDFOpen Website

2020 (modified: 30 Sept 2024)ICML 2020Readers: Everyone
Abstract: When quantizing neural networks, assigning each floating-point weight to its nearest fixed-point value is the predominant approach. We find that, perhaps surprisingly, this is not the best we can d...
0 Replies

Loading