Continual Learning via Explicit Structure Learning

Xilai Li; Yingbo Zhou; Tianfu Wu; Richard Socher; Caiming Xiong

Continual Learning via Explicit Structure Learning

Xilai Li, Yingbo Zhou, Tianfu Wu, Richard Socher, Caiming Xiong

27 Sept 2018 (modified: 05 May 2023)ICLR 2019 Conference Blind SubmissionReaders: Everyone

Abstract: Despite recent advances in deep learning, neural networks suffer catastrophic forgetting when tasks are learned sequentially. We propose a conceptually simple and general framework for continual learning, where structure optimization is considered explicitly during learning. We implement this idea by separating the structure and parameter learning. During structure learning, the model optimizes for the best structure for the current task. The model learns when to reuse or modify structure from previous tasks, or create new ones when necessary. The model parameters are then estimated with the optimal structure. Empirically, we found that our approach leads to sensible structures when learning multiple tasks continuously. Additionally, catastrophic forgetting is also largely alleviated from explicit learning of structures. Our method also outperforms all other baselines on the permuted MNIST and split CIFAR datasets in continual learning setting.

Keywords: continuous learning, catastrophic forgetting, architecture learning

12 Replies

Loading