MBQuant: A novel multi-branch topology method for arbitrary bit-width network quantization

Published: 01 Jan 2025, Last Modified: 10 Jan 2025Pattern Recognit. 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•We find that existing arbitrary bit-width methods suffer from accumulated quantization errors from switching weight and activations bit-widths.•We introduce MBQuant, which utilizes the multi-branch topology and an amortization strategy to address accumulated quantization errors.•Extensive experiments demonstrate that MBQuant achieves significant performance gains compared to existing arbitrary bit-width methods.
Loading