NMT Learning Curve

Re-iteration of the famous learning curve experiment from Koehn and Knowles (2017)

Figure 1. NMT learning curve (Koehn and Knowles, 2017)

Aim

Neural/deeplearning models are known to have poorer formance in lesser training data scenarios, which is demonstrated in Figure 1. In this work, new neural MT approaches such as Transformers are compared with the non-neural and other predecessors to see how much improvements has been made.

Setup

Train NMT models at different training corpus size, and track its peformance on a test set (BLEU). Use the same data sets and splits as Koehn and Knowles (2017), as well as compare the results with their

Figure 2. NMT learning curve revisited

Summary:

Transformer NMT requires lesser training data than RNN NMT used by Koehn and Knowles (2017). See Transformer base in the Figure 2.
The Transformer base is already consistently higher than prior neural model, it can be further improved by tuning a few hyperparameters such as batch size and vocabualary size (Transformer varbatch in Figure 2)

Take Aways

Neural models are parameteric models. Parametric models needs its hyperparameters to be carefully chosen
To achieve good performance in low-resource / less training data scenarios, hyperparameter values needs to be carefully set

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
fairseq		fairseq
results		results
rtg-runs		rtg-runs
.gitignore		.gitignore
LICENSE		LICENSE
README.adoc		README.adoc
get-data.sh		get-data.sh
learning-curve.pdf		learning-curve.pdf
nmt-learning-curve-2017.png		nmt-learning-curve-2017.png
nmt-learning-curve.png		nmt-learning-curve.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NMT Learning Curve

Aim

Setup

Summary:

Take Aways

About

Releases

Packages

Languages

License

thammegowda/004-nmt-learning-curve

Folders and files

Latest commit

History

Repository files navigation

NMT Learning Curve

Aim

Setup

Summary:

Take Aways

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages