DFVG: A Heterogeneous Architecture for Speculative Decoding with Draft-on-FPGA and Verify-on-GPU

Shaoqiang Lu, Yangbo Wei, Junhong Qian, Dongge Qin, Shiji Gao, Yizhi Ding, Qifan Wang, Chen Wu, Xiao Shi, Lei He

Published: 2026, Last Modified: 16 Apr 2026ASPLOS (2) 2026EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading