LoSparse: Structured Compression of Large Language Models based on Low-Rank and Sparse ApproximationDownload PDFOpen Website

Published: 01 Jan 2023, Last Modified: 29 Sept 2023ICML 2023Readers: Everyone
Abstract: Transformer models have achieved remarkable results in various natural language tasks, but they are often prohibitively large, requiring massive memories and computational resources. To re- duce th...
0 Replies

Loading