## 1. Overview

The English-German machine translation workload trains a Tramsfomer model on WMT
2017 Machine Translation en-de dataset. We get bleu value 25.03 (evaluated on
WMT 2014) after 23634 steps or 17652.04 seconds on a 8 V-100 GPU machine (1.34
step/second).

## 2. Reference

https://github.com/google/flax/tree/main/examples/wmt

