Discriminative Representation Loss (DRL): A More Efficient Approach than Gradient Re-Projection in Continual Learning

Yu Chen; Tom Diethe; Peter Flach

Discriminative Representation Loss (DRL): A More Efficient Approach than Gradient Re-Projection in Continual Learning

Yu Chen, Tom Diethe, Peter Flach

28 Sept 2020 (modified: 05 May 2023)ICLR 2021 Conference Blind SubmissionReaders: Everyone

Keywords: continual learning, episodic memory, GEM, experience replay, deep metric learning

Abstract: The use of episodic memories in continual learning has been shown to be effective in terms of alleviating catastrophic forgetting. In recent studies, several gradient-based approaches have been developed to make more efficient use of compact episodic memories, which constrain the gradients resulting from new samples with those from memorized samples, aiming to reduce the diversity of gradients from different tasks. In this paper, we reveal the relation between diversity of gradients and discriminativeness of representations, demonstrating connections between Deep Metric Learning and continual learning. Based on these findings, we propose a simple yet efficient method -- Discriminative Representation Loss (DRL) -- for continual learning. In comparison with several state-of-the-art methods, this method shows effectiveness with low computational cost on multiple benchmark experiments in the setting of online continual learning.

One-sentence Summary: We reveal the relation between diversity of gradients and discriminativeness of representations and show connections between deep metric learning and continual learning, based on which, we propose a simple yet efficient method for continual learning.

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics

Supplementary Material: zip

Reviewed Version (pdf): https://openreview.net/references/pdf?id=b-HAkxG8z

22 Replies

Loading