PipeMLLM: Accelerating on-device Multimodal LLM Inference via Speculative Sensing and Encoding

Runxi Huang, Xiaomin Ouyang

Published: 04 Nov 2025, Last Modified: 07 Mar 2026CrossrefEveryoneRevisionsCC BY-SA 4.0
Loading