A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules

Published: 2025, Last Modified: 07 Oct 2025ICLR 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading