S4C: Speculative Sampling with Syntactic and Semantic Coherence for Efficient Inference of Large Language Models.

Tao He, Guang Huang, Yu Yang, Tianshi Xu, Sicheng Zhao, Guiguang Ding, Pengyang Wang, Feng Tian

16 Jan 2026CoRR 2025EveryoneCC BY-SA 4.0
Loading