Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning.

Md Rifat Arefin, Gopeshh Subbaraj, Nicolas Gontier, Yann LeCun, Irina Rish, Ravid Shwartz-Ziv, Christopher Pal

15 Oct 2025CoRR 2024EveryoneCC BY-SA 4.0
Loading