Universal-KD: Attention-based Output-Grounded Intermediate Layer Knowledge DistillationDownload PDFOpen Website

Published: 2021, Last Modified: 11 Jun 2023EMNLP (1) 2021Readers: Everyone
0 Replies

Loading