OpenReview
.net
OpenReview
.net
Login
OpenReview
.net
Login
Go to
DBLP
homepage
Combatting Dimensional Collapse in LLM Pre-Training Data via Submodular File Selection
Ziqing Fan
,
Siyuan Du
,
Shengchao Hu
,
Pingjie Wang
,
Li Shen
,
Ya Zhang
,
Dacheng Tao
,
Yanfeng Wang
Published: 2025, Last Modified: 07 Dec 2025
ICLR 2025
Everyone
Revisions
BibTeX
CC BY-SA 4.0
Loading