Enhancing Attention with Explicit Phrasal Alignments

Xuan-Phi Nguyen; Shafiq Joty; Thanh-Tung Nguyen

Enhancing Attention with Explicit Phrasal Alignments

Xuan-Phi Nguyen, Shafiq Joty, Thanh-Tung Nguyen

25 Sept 2019 (modified: 05 May 2023)ICLR 2020 Conference Blind SubmissionReaders: Everyone

Abstract: The attention mechanism is an indispensable component of any state-of-the-art neural machine translation system. However, existing attention methods are often token-based and ignore the importance of phrasal alignments, which are the backbone of phrase-based statistical machine translation. We propose a novel phrase-based attention method to model n-grams of tokens as the basic attention entities, and design multi-headed phrasal attentions within the Transformer architecture to perform token-to-token and token-to-phrase mappings. Our approach yields improvements in English-German, English-Russian and English-French translation tasks on the standard WMT'14 test set. Furthermore, our phrasal attention method shows improvements on the one-billion-word language modeling benchmark.

Keywords: NMT, Phrasal Attention, Machine Translation, Language Modeling

Original Pdf: pdf

4 Replies

Loading