Can Transformers Learn n-gram Language Models?

Anej Svete, Nadav Borenstein, Mike Zhou, Isabelle Augenstein, Ryan Cotterell

27 Jan 2026CoRR 2024EveryoneCC BY-SA 4.0
Loading