Feed-Forward Networks with Attention Can Solve Some Long-Term Memory Problems

Colin Raffel; Daniel P. W. Ellis

Feed-Forward Networks with Attention Can Solve Some Long-Term Memory Problems

Colin Raffel, Daniel P. W. Ellis

20 Feb 2026 (modified: 12 Feb 2016)ICLR 2016Readers: Everyone

Abstract: We propose a simplified model of attention which is applicable to feed-forward neural networks and demonstrate that the resulting model can solve the synthetic "addition" and "multiplication" long-term memory problems for sequence lengths which are both longer and more widely varying than the best published results for these tasks.

Conflicts: columbia.edu, google.com

9 Replies

Loading