Shake-Shake regularization of 3-branch residual networksDownload PDF

May 10, 2021 (edited Mar 15, 2017)ICLR 2017 workshop submissionReaders: Everyone
  • TL;DR: Reduce overfit by replacing, in a 3-branch ResNet, the standard summation of residual branches by a stochastic affine combination
  • Abstract: The method introduced in this paper aims at helping computer vision practitioners faced with an overfit problem. The idea is to replace, in a 3-branch ResNet, the standard summation of residual branches by a stochastic affine combination. The largest tested model improves on the best single shot published result on CIFAR-10 by reaching 2.86% test error. Code is available at https://github.com/xgastaldi/shake-shake
  • Keywords: Computer vision, Deep learning, Supervised Learning
  • Conflicts: n/a
24 Replies

Loading