OpenReview
.net
OpenReview
.net
Login
OpenReview
.net
Login
Go to
DBLP
homepage
GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference
Chao Zeng
,
Songwei Liu
,
Shu Yang
,
Fangmin Chen
,
Xing Mei
,
Lean Fu
Published: 2024, Last Modified: 29 Mar 2026
CoRR 2024
Everyone
Revisions
BibTeX
CC BY-SA 4.0
External IDs:
dblp:journals/corr/abs-2412-17560
Loading