An Empirical Study of Example Forgetting during Deep Neural Network Learning

Mariya Toneva*; Alessandro Sordoni*; Remi Tachet des Combes*; Adam Trischler; Yoshua Bengio; Geoffrey J. Gordon

An Empirical Study of Example Forgetting during Deep Neural Network Learning

Mariya Toneva, Alessandro Sordoni, Remi Tachet des Combes*, Adam Trischler, Yoshua Bengio, Geoffrey J. Gordon

Published: 21 Dec 2018, Last Modified: 22 Jun 2025ICLR 2019 Conference Blind SubmissionReaders: Everyone

Abstract: Inspired by the phenomenon of catastrophic forgetting, we investigate the learning dynamics of neural networks as they train on single classification tasks. Our goal is to understand whether a related phenomenon occurs when data does not undergo a clear distributional shift. We define a ``forgetting event'' to have occurred when an individual training example transitions from being classified correctly to incorrectly over the course of learning. Across several benchmark data sets, we find that: (i) certain examples are forgotten with high frequency, and some not at all; (ii) a data set's (un)forgettable examples generalize across neural architectures; and (iii) based on forgetting dynamics, a significant fraction of examples can be omitted from the training data set while still maintaining state-of-the-art generalization performance.

Keywords: catastrophic forgetting, sample weighting, deep generalization

TL;DR: We show that catastrophic forgetting occurs within what is considered to be a single task and find that examples that are not prone to forgetting can be removed from the training set without loss of generalization.

Code: [![github](/images/github_icon.svg) mtoneva/example_forgetting](https://github.com/mtoneva/example_forgetting) + [![Papers with Code](/images/pwc_icon.svg) 2 community implementations](https://paperswithcode.com/paper/?openreview=BJlxm30cKm)

Data: [CIFAR-10](https://paperswithcode.com/dataset/cifar-10)

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 3 code implementations](https://www.catalyzex.com/paper/an-empirical-study-of-example-forgetting/code)

14 Replies

Loading

An Empirical Study of Example Forgetting during Deep Neural Network Learning

Mariya Toneva*, Alessandro Sordoni*, Remi Tachet des Combes*, Adam Trischler, Yoshua Bengio, Geoffrey J. Gordon

Mariya Toneva, Alessandro Sordoni, Remi Tachet des Combes*, Adam Trischler, Yoshua Bengio, Geoffrey J. Gordon