LS-PRISM: A layer-selective pruning method via low-rank approximation and sparsification for efficient large language model compression

Renshuai Tao, Hairong Chen, Yuzhe Guo, Jiakai Wang, Boying Wang, Rongrong Ni, Yao Zhao

Published: 01 Dec 2025, Last Modified: 12 Mar 2026Neural NetworksEveryoneRevisionsCC BY-SA 4.0
Loading