I-BERT: Integer-only BERT QuantizationDownload PDFOpen Website

Published: 2021, Last Modified: 17 May 2023ICML 2021Readers: Everyone
Abstract: Transformer based models, like BERT and RoBERTa, have achieved state-of-the-art results in many Natural Language Processing tasks. However, their memory footprint, inference latency, and power cons...
0 Replies

Loading