Notion: Will re-start this repo from scratch
V1: Transfomer-MachineTranslation
Building Transfomer Machine Translate till Non-Autoregressive
References:
-
Attention is all you need -- https://arxiv.org/abs/1706.03762
-
Scheduled DropHead: A Regularization Method for Transformer Models -- http://arxiv.org/abs/2004.13342