Stabilized activation scale estimation for precise Post-Training Quantization

Published: 2024, Last Modified: 10 Nov 2025Neurocomputing 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•For the first time, introducing the EMA mechanism for stabilized activation scale updating.•Introducing more dissimilar activation maps into weight reconstruction optimization for better PTQ accuracy.•Achieving remarkable improvements in several different bit quantization especially W2A4.
Loading