2017 (modified: 11 Nov 2022)ICML 2017Readers: Everyone
Abstract:It is well known that it is challenging to train deep neural networks and recurrent neural networks for tasks that exhibit long term dependencies. The vanishing or exploding gradient problem is a w...