Toggle navigation
OpenReview
.net
Login
×
Back to
ICML
ICML 2025 Workshop ES-FoMo-III Submissions
pLSTM: parallelizable Linear Source Transition Mark networks
Korbinian Pöppel
,
Richard Freinschlag
,
Thomas Schmied
,
Wei Lin
,
Sepp Hochreiter
Published: 11 Jun 2025, Last Modified: 10 Jul 2025
ES-FoMo III
Readers:
Everyone
Mitigating Over-Smoothing in Mamba2 via Spectral Domain Analysis
Seojin Kim
,
Yehjin Shin
,
Noseong Park
Published: 11 Jun 2025, Last Modified: 10 Jul 2025
ES-FoMo III
Readers:
Everyone
Efficient Temporal Tokenization for Mobility Prediction with Large Language Models
Haoyu He
,
Haozheng Luo
,
Yan Chen
,
Qi R. Wang
Published: 11 Jun 2025, Last Modified: 10 Jul 2025
ES-FoMo III
Readers:
Everyone
AREAL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning
Wei Fu
,
Jiaxuan Gao
,
Shusheng Xu
,
Zhiyu Mei
,
Chen Zhu
,
Xujie Shen
,
Chuyi He
,
Guo Wei
,
Jun Mei
,
WANG JIASHU
,
Tongkai Yang
,
Binhang Yuan
,
Yi Wu
Published: 11 Jun 2025, Last Modified: 10 Jul 2025
ES-FoMo III Spotlight
Readers:
Everyone
AWP: Activation-aware Weight Pruning and Quantization with Projected Gradient Descent
Jing Liu
,
Toshiaki Koike-Akino
,
Ye Wang
,
Hassan Mansour
,
Matthew Brand
Published: 11 Jun 2025, Last Modified: 10 Jul 2025
ES-FoMo III
Readers:
Everyone
QuarterMap: Efficient Post-Training Token Pruning for Visual State Space Models
Tien-Yu Chi
,
Hung-Yueh Chiang
,
Diana Marculescu
,
Kai-Chiang Wu
Published: 11 Jun 2025, Last Modified: 10 Jul 2025
ES-FoMo III
Readers:
Everyone
d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning
Siyan Zhao
,
Devaansh Gupta
,
Qinqing Zheng
,
Aditya Grover
Published: 11 Jun 2025, Last Modified: 10 Jul 2025
ES-FoMo III
Readers:
Everyone
Thinformer: Guaranteed Attention Approximation via Low-Rank Thinning
Annabelle Michael Carrell
,
Albert Gong
,
Abhishek Shetty
,
Raaz Dwivedi
,
Lester Mackey
Published: 11 Jun 2025, Last Modified: 10 Jul 2025
ES-FoMo III
Readers:
Everyone
Flexi-LoRA: Efficient LoRA Finetuning with Input-Adaptive Dynamic Ranks
Zongqian Li
,
Yixuan Su
,
Han Zhou
,
Zihao Fu
,
Nigel Collier
Published: 11 Jun 2025, Last Modified: 10 Jul 2025
ES-FoMo III
Readers:
Everyone
CoDM: A Co-design Framework for Efficient Sparse Diffusion Models
Xiaolong Wu
,
Xiang Gao
,
Xiyun Song
,
Zongfang Lin
,
Heather Yu
,
David Gu
Published: 11 Jun 2025, Last Modified: 10 Jul 2025
ES-FoMo III
Readers:
Everyone
Speeding up Speculative Decoding via Sequential Approximate Verification
Meiyu Zhong
,
Noel Teku
,
Ravi Tandon
Published: 11 Jun 2025, Last Modified: 10 Jul 2025
ES-FoMo III
Readers:
Everyone
Learning What Matters: Prioritized Concept Learning via Relative Error-driven Sample Selection
Shivam Chandhok
,
Qian Yang
,
Oscar Mañas
,
Kanishk Jain
,
Aishwarya Agrawal
,
Leonid Sigal
Published: 11 Jun 2025, Last Modified: 10 Jul 2025
ES-FoMo III
Readers:
Everyone
GPTailor: Large Language Model Pruning Through Layer Cutting and Stitching
Guinan Su
,
Li Shen
,
Lu Yin
,
Shiwei Liu
,
Yanwu Yang
,
Jonas Geiping
Published: 11 Jun 2025, Last Modified: 10 Jul 2025
ES-FoMo III
Readers:
Everyone
Is Visual Prompting the Right Setup for Knowledge Transfer in new Foundation Models?
Niclas Hergenröther
,
Antonio Orvieto
Published: 11 Jun 2025, Last Modified: 10 Jul 2025
ES-FoMo III
Readers:
Everyone
MatMuls are Enough for Efficient and Performant Linear-Time Attention
Andrew Argatkiny
,
Ilya Makarov
Published: 11 Jun 2025, Last Modified: 10 Jul 2025
ES-FoMo III
Readers:
Everyone
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
Jonas Geiping
,
Sean Michael McLeish
,
Neel Jain
,
John Kirchenbauer
,
Siddharth Singh
,
Brian R. Bartoldson
,
Bhavya Kailkhura
,
Abhinav Bhatele
,
Tom Goldstein
Published: 11 Jun 2025, Last Modified: 10 Jul 2025
ES-FoMo III Spotlight
Readers:
Everyone
PiKE: Adaptive Data Mixing for Large-Scale Multi-Task Learning Under Low Gradient Conflicts
Zeman Li
,
Yuan Deng
,
Peilin Zhong
,
Meisam Razaviyayn
,
Vahab Mirrokni
Published: 11 Jun 2025, Last Modified: 10 Jul 2025
ES-FoMo III
Readers:
Everyone
Training Language Models to Reason Efficiently
Daman Arora
,
Andrea Zanette
Published: 11 Jun 2025, Last Modified: 10 Jul 2025
ES-FoMo III
Readers:
Everyone
Language System: A Lightweight Ranking Framework for Language Models
Chenheng Zhang
,
Tianqi Du
,
Jizhe Zhang
,
Mingqing Xiao
,
Yifei Wang
,
Yisen Wang
,
Zhouchen Lin
Published: 11 Jun 2025, Last Modified: 10 Jul 2025
ES-FoMo III
Readers:
Everyone
FPTQuant: Function-Preserving Transforms for LLM Quantization
Boris van Breugel
,
Yelysei Bondarenko
,
Paul N. Whatmough
,
Markus Nagel
Published: 11 Jun 2025, Last Modified: 10 Jul 2025
ES-FoMo III Oral
Readers:
Everyone
An Efficient Row-Based Sparse Fine-Tuning with Low Quantization Error
Cen-Jhih Li
,
Aditya Bhaskara
Published: 11 Jun 2025, Last Modified: 10 Jul 2025
ES-FoMo III
Readers:
Everyone
VScan: A Two-Stage Visual Token Reduction Framework for Accelerating Large Vision-Language Models
Ce Zhang
,
Kaixin Ma
,
Tianqing Fang
,
Wenhao Yu
,
Hongming Zhang
,
Zhisong Zhang
,
Yaqi Xie
,
Katia P. Sycara
,
Haitao Mi
,
Dong Yu
Published: 11 Jun 2025, Last Modified: 10 Jul 2025
ES-FoMo III
Readers:
Everyone
MTraining: Efficient Distributed Training for Ultra-Long Contexts via Dynamic Sparse Attention
Wenxuan Li
,
Chengruidong Zhang
,
Huiqiang Jiang
,
Yucheng Li
,
Yuqing Yang
,
Lili Qiu
Published: 11 Jun 2025, Last Modified: 10 Jul 2025
ES-FoMo III
Readers:
Everyone
TMA-Adaptive FP8 Grouped GEMM: Eliminating Padding Requirements in Low-Precision Training and Inference on Hopper
Suzhongling
,
Rong Fu
,
Weihan Cao
,
Jianfei Gao
,
Minxi Jin
,
PeiZhilin
,
Hui Wang
Published: 11 Jun 2025, Last Modified: 10 Jul 2025
ES-FoMo III
Readers:
Everyone
KVzip: Query-Agnostic KV Cache Compression with Context Reconstruction
Jang-Hyun Kim
,
Jinuk Kim
,
Sangwoo Kwon
,
Jae W. Lee
,
Sangdoo Yun
,
Hyun Oh Song
Published: 11 Jun 2025, Last Modified: 10 Jul 2025
ES-FoMo III
Readers:
Everyone
«
‹
1
2
3
4
5
6
›
»