FLRQ: Faster LLM Quantization with Flexible Low-Rank Matrix Sketching

Hongyaoxing Gu, Lijuan Hu, Shuzi Niu, Fangfang Liu

Published: 2026, Last Modified: 06 May 2026CoRR 2026EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading