Nonconvex Continual Learning with Episodic Memory

Seungyub Han; Yeongmo Kim; Jungwoo Lee

Nonconvex Continual Learning with Episodic Memory

Seungyub Han, Yeongmo Kim, Jungwoo Lee

28 Sept 2020 (modified: 23 Jan 2025)ICLR 2021 Conference Blind SubmissionReaders: Everyone

Keywords: continual learning, nonconvex optimization

Abstract: Continual learning aims to prevent catastrophic forgetting while learning a new task without accessing data of previously learned tasks. The memory for such learning scenarios build a small subset of the data for previous tasks and is used in various ways such as quadratic programming and sample selection. Current memory-based continual learning algorithms are formulated as a constrained optimization problem and rephrase constraints as a gradient-based approach. However, previous works have not provided the theoretical proof on convergence to previously learned tasks. In this paper, we propose a theoretical convergence analysis of continual learning based on stochastic gradient descent method. Our method, nonconvex continual learning (NCCL), can achieve the same convergence rate when the proposed catastrophic forgetting term is suppressed at each iteration. We also show that memory-based approaches have an inherent problem of overfitting to memory, which degrades the performance on previously learned tasks, namely catastrophic forgetting. We empirically demonstrate that NCCL successfully performs continual learning with episodic memory by scaling learning rates adaptive to mini-batches on several image classification tasks.

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics

One-sentence Summary: We provide a theoretical convergence analysis of continual learning and propose a simple gradient scaling based continual learning algorithm.

Supplementary Material: zip

Reviewed Version (pdf): https://openreview.net/references/pdf?id=BPbv9xLO3W

10 Replies

Loading