MQBench: Towards Reproducible and Deployable Model Quantization Benchmark

Yuhang Li; Mingzhu Shen; Jian Ma; Yan Ren; Mingxin Zhao; Qi Zhang; Ruihao Gong; Fengwei Yu; Junjie Yan

MQBench: Towards Reproducible and Deployable Model Quantization Benchmark

Yuhang Li, Mingzhu Shen, Jian Ma, Yan Ren, Mingxin Zhao, Qi Zhang, Ruihao Gong, Fengwei Yu, Junjie Yan

Published: 29 Jul 2021, Last Modified: 26 May 2025NeurIPS 2021 Datasets and Benchmarks Track (Round 1)Readers: Everyone

Keywords: Quantization-aware Training, Post-training Quantization, Benchmark

Abstract: Model quantization has emerged as an indispensable technique to accelerate deep learning inference. Although researchers continue to push the frontier of quantization algorithms, existing quantization work is often unreproducible and undeployable. This is because researchers do not choose consistent training pipelines and ignore the requirements for hardware deployments. In this work, we propose Model Quantization Benchmark (MQBench), a first attempt to evaluate, analyze, and benchmark the reproducibility and deployability for model quantization algorithms. We choose multiple different platforms for real-world deployments, including CPU, GPU, ASIC, DSP, and evaluate extensive state-of-the-art quantization algorithms under a unified training pipeline. MQBench acts like a bridge to connect the algorithm and the hardware. We conduct a comprehensive analysis and find considerable intuitive or counter-intuitive insights. By aligning up the training settings, we find existing algorithms have about-the-same performance on the conventional academic track. While for the hardware-deployable quantization, there is a huge accuracy gap and still a long way to go. Surprisingly, no existing algorithm wins every challenge in MQBench, and we hope this work could inspire future research directions.

Supplementary Material: zip

TL;DR: We design a benchmark for quantization algorithms and target hardware.

URL: http://mqbench.tech/

Contribution Process Agreement: Yes

Author Statement: Yes

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 2 code implementations](https://www.catalyzex.com/paper/mqbench-towards-reproducible-and-deployable/code)

9 Replies

Loading