Maximizing Intermediate Checkpoint Value in LLM Pretraining with Bayesian Optimization.

Deyuan Liu, Zecheng Wang, Bingning Wang, Weipeng Chen, Chunshan Li, Zhiying Tu, Dianhui Chu, Dianbo Sui

21 Jan 2026 (modified: 21 Jan 2026)ICML 2025EveryoneRevisionsCC BY-SA 4.0
Loading