Toggle navigation
OpenReview
.net
Login
×
Go to
DBLP
homepage
Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow
Yixuan Mei
,
Yonghao Zhuang
,
Xupeng Miao
,
Juncheng Yang
,
Zhihao Jia
,
Rashmi Vinayak
Published: 01 Jan 2025, Last Modified: 15 May 2025
ASPLOS (1) 2025
Everyone
Revisions
BibTeX
CC BY-SA 4.0
Loading