Pre-Warming is Not Enough: Accelerating Serverless Inference With Opportunistic Pre-Loading

Yifan Sui, Hanfei Yu, Yitao Hu, Jianxun Li, Hao Wang

Published: 20 Nov 2024, Last Modified: 07 Jan 2026CrossrefEveryoneRevisionsCC BY-SA 4.0
Loading