Transformer Code: https://github.com/tensorflow/tensor2tensor Paper: Attention is All you Need.