Anatomy of Catastrophic Forgetting: Hidden Representations and Task Semantics

Vinay Venkatesh Ramasesh; Ethan Dyer; Maithra Raghu

Anatomy of Catastrophic Forgetting: Hidden Representations and Task Semantics

Vinay Venkatesh Ramasesh, Ethan Dyer, Maithra Raghu

Published: 12 Jan 2021, Last Modified: 26 May 2025ICLR 2021 PosterReaders: Everyone

Keywords: Catastrophic forgetting, continual learning, representation analysis, representation learning

Abstract: Catastrophic forgetting is a recurring challenge to developing versatile deep learning models. Despite its ubiquity, there is limited understanding of its connections to neural network (hidden) representations and task semantics. In this paper, we address this important knowledge gap. Through quantitative analysis of neural representations, we find that deeper layers are disproportionately responsible for forgetting, with sequential training resulting in an erasure of earlier task representational subspaces. Methods to mitigate forgetting stabilize these deeper layers, but show diversity on precise effects, with some increasing feature reuse while others store task representations orthogonally, preventing interference. These insights also enable the development of an analytic argument and empirical picture relating forgetting to task semantic similarity, where we find that maximal forgetting occurs for task sequences with intermediate similarity.

One-sentence Summary: We study the layerwise change in representations due to catastrophic forgetting, and use our understanding to study how task similarity influences forgetting.

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics

Supplementary Material: zip

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 1 code implementation](https://www.catalyzex.com/paper/anatomy-of-catastrophic-forgetting-hidden/code)

12 Replies

Loading