EDGE-LLM: Enabling Efficient Large Language Model Adaptation on Edge Devices via Unified Compression and Adaptive Layer Voting

Zhongzhi Yu, Zheng Wang, Yuhan Li, Ruijie Gao, Xiaoya Zhou, Sreenidhi Reddy Bommu, Yang (Katie) Zhao, Yingyan (Celine) Lin

Published: 23 Jun 2024, Last Modified: 01 Mar 2026CrossrefEveryoneRevisionsCC BY-SA 4.0
Loading