AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving

Published: 2023, Last Modified: 27 Jan 2026OSDI 2023EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading