Exploring and enhancing the transfer of distribution in knowledge distillation for autoregressive language models

Jun Rao, Xuebo Liu, Zepeng Lin, Liang Ding, Jing Li, Min Zhang

Published: 2026, Last Modified: 22 Mar 2026Knowl. Based Syst. 2026EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading