Optimized Multi-Token Joint Decoding With Auxiliary Model for LLM Inference

Published: 2025, Last Modified: 01 May 2026ICLR 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading