Toggle navigation
OpenReview
.net
Login
×
Go to
UAI 2021
homepage
ReZero is all you need: fast convergence at large depth
Thomas Bachlechner
,
Bodhisattwa Prasad Majumder
,
Huanru Henry Mao
,
Gary Cottrell
,
Julian J. McAuley
2021 (modified: 06 Apr 2022)
UAI 2021
Readers:
Everyone
Abstract:
Deep networks often suffer from vanishing or exploding gradients due to inefficient signal propagation, leading to long training times or convergence difficulties. Various architecture designs, sop...
0 Replies
Loading