From Offline to Online Memory-Free and Task-Free Continual Learning via Fine-Grained Hypergradients

Nicolas Michel; Maorong Wang; Jiangpeng He; Toshihiko Yamasaki

From Offline to Online Memory-Free and Task-Free Continual Learning via Fine-Grained Hypergradients

Nicolas Michel, Maorong Wang, Jiangpeng He, Toshihiko Yamasaki

19 Sept 2025 (modified: 11 Feb 2026)Submitted to ICLR 2026EveryoneRevisionsBibTeXCC BY 4.0

Keywords: Online Continual Learning, Gradient Imbalance, Blurry Task Boundaries

Abstract: Continual Learning (CL) aims to learn from a non-stationary data stream where the underlying distribution changes over time. While recent advances have produced efficient memory-free methods in the offline CL (offCL) setting online CL (onCL) remains dominated by memory-based approaches. The transition from offCL to onCL is challenging, as many offline methods rely on (1) prior knowledge of task boundaries and (2) sophisticated scheduling or optimization schemes, both of which are unavailable when data arrives sequentially and can be seen only once. In this paper, we investigate the adaptation of state-of-the-art memory-free offCL methods to the online setting. We first show that augmenting these methods with lightweight prototypes significantly improves performance, albeit at the cost of increased Gradient Imbalance, resulting in a biased learning towards earlier tasks. To address this issue, we introduce Fine-Grained Hypergradients, an online mechanism for rebalancing gradient updates during training. Our experiments demonstrate that the synergy between prototype memory and hypergradient reweighting substantially allows for improved performances of memory-free methods in onCL. Code will be released upon acceptance.

Supplementary Material: zip

Primary Area: transfer learning, meta learning, and lifelong learning

Submission Number: 17004

Loading