Neural Machine Translation System (Pytorch)

State-of-the-art NMT systems are difficult to understand.

For the beginners, I highly recommend this project, which is a simplified version of Opennmt-py.

Though some modules reference to OpenNMT-py, most of the code (80%) is written by myself. With the scripts that I posted, you can even build your own NMT systems and evaluate them on WMT 2018 datasets.

REQUIREMENTS

(I implemented it with Pytorch 0.4. ) Python version >= 3.6 (recommended) Pytorch version >= 0.4 (recommended)

Usage

For training, please use a Moses-style configuration file to specify paths and hyper-parameters.

 python train.py --config config/nmt.ini

For translation,

python translate.py --config config/nmt.ini --checkpoint {pretrained_model.pt} -v

A known bug is that beam_search does not support batch decoding: when using beam_search = True, you need to set test_batch_size=1 to make the output correct.

For monotonic decoding (without beam_search), you can use any number for test_batch_size.

Optimizer

LSTM:

SGD 1.0 learning_rate_decay as 0.9 (recommended)
GRU:

Adam 1e-4 max_grad_norm = 5 (recommended)
Transformer:

Adam 1e-4, grad_accum_count = 4~5, label_smoothing=0.1 (recommended)

References

Vaswani, Ashish, et al. "Attention is all you need." Advances in Neural Information Processing Systems. 2017
Luong, Minh-Thang, Hieu Pham, and Christopher D. Manning. "Effective approaches to attention-based neural machine translation." arXiv preprint arXiv:1508.04025 (2015).
Bahdanau, Dzmitry, Kyunghyun Cho, and Yoshua Bengio. "Neural machine translation by jointly learning to align and translate." arXiv preprint arXiv:1409.0473 (2014).

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
NMT		NMT
Utils		Utils
config		config
scripts		scripts
.gitignore		.gitignore
README.md		README.md
ensemble.py		ensemble.py
train.py		train.py
translate.py		translate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Neural Machine Translation System (Pytorch)

REQUIREMENTS

Usage

Optimizer

References

About

Releases

Packages

Languages

wang-h/pynmt

Folders and files

Latest commit

History

Repository files navigation

Neural Machine Translation System (Pytorch)

REQUIREMENTS

Usage

Optimizer

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages