Local-Cloud Inference Offloading for LLMs in Multi-Modal, Multi-Task, Multi-Dialogue Settings

Liangqi Yuan, Dong-Jun Han, Shiqiang Wang, Christopher Brinton

Published: 2025, Last Modified: 24 Mar 2026MobiHoc 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading