Toggle navigation
OpenReview
.net
Login
×
Back to
ICLR
ICLR 2025 Workshop SCOPE Submissions
UniForm: A Reuse Attention Mechanism for Efficient Transformers on Resource-Constrained Edge Devices
Seul-Ki Yeom
,
Tae-Ho Kim
Published: 05 Mar 2025, Last Modified: 14 Apr 2025
SCOPE - ICLR 2025 Poster
Readers:
Everyone
KV Prediction for Improved Time to First Token
Maxwell Horton
,
Qingqing Cao
,
Chenfan Sun
,
Yanzi Jin
,
Sachin Mehta
,
Mohammad Rastegari
,
Moin Nabi
Published: 05 Mar 2025, Last Modified: 14 Apr 2025
SCOPE - ICLR 2025 Poster
Readers:
Everyone
Llamba: Scaling Distilled Recurrent Models for Efficient Language Processing
Aviv Bick
,
Tobias Katsch
,
Nimit Sharad Sohoni
,
Arjun D Desai
,
Albert Gu
Published: 05 Mar 2025, Last Modified: 14 Apr 2025
SCOPE - ICLR 2025 Poster
Readers:
Everyone
On Vanishing Variance in Transformer Length Generalization
Ruining Li
,
Gabrijel Boduljak
,
Jensen Zhou
Published: 05 Mar 2025, Last Modified: 14 Apr 2025
SCOPE - ICLR 2025 Poster
Readers:
Everyone
Attention Is All You Need For Mixture-of-Depths Routing
Advait Gadhikar
,
Souptik Kumar Majumdar
,
Niclas Popp
,
Piyapat Saranrittichai
,
Martin Rapp
,
Lukas Schott
Published: 05 Mar 2025, Last Modified: 14 Apr 2025
SCOPE - ICLR 2025 Poster
Readers:
Everyone
Context Is All You Need: Efficient Retrieval Augmented Generation for Domain Specific AI
Peixi Xiong
,
Chaunte W. Lacewell
,
Sameh Gobriel
,
Nilesh Jain
Published: 05 Mar 2025, Last Modified: 14 Apr 2025
SCOPE - ICLR 2025 Poster
Readers:
Everyone
«
‹
1
2
3
›
»