Revisiting Replay and Gradient Alignment For Continual Pretraining of Large Language Models

Published: 22 Sept 2025, Last Modified: 03 Jan 2026WiML @ NeurIPS 2025EveryoneRevisionsBibTeXCC BY 4.0
Keywords: continual learning, continual pretraining, large language models
Submission Number: 330
Loading