OpenReview
.net
OpenReview
.net
Login
OpenReview
.net
Login
Back to
the profile of Yanxuan Yu
TinyServe: Query-Aware Cache Selection for Efficient LLM Serving
Dong Liu
,
Yanxuan Yu
Published: 27 Oct 2025, Last Modified: 15 Jan 2026
Crossref
Everyone
Revisions
CC BY-SA 4.0
External IDs:
doi:10.1145/3746027.3758181
Loading