Toggle navigation
OpenReview
.net
Login
×
Go to
DBLP
homepage
Accelerating Iterative Retrieval-augmented Language Model Serving with Speculation
Zhihao Zhang
,
Alan Zhu
,
Lijie Yang
,
Yihua Xu
,
Lanting Li
,
Phitchaya Mangpo Phothilimthana
,
Zhihao Jia
Published: 01 Jan 2024, Last Modified: 16 May 2025
ICML 2024
Everyone
Revisions
BibTeX
CC BY-SA 4.0
Loading