Seq2Seq Neural Machine Translation model with different optimization techniques

This is our implementation of seq2seq neural machine translation model with different optimization techniques. For details, see the corresponding NIPS'17 workshop paper

Setting up

Run the script (borrowed from Harvard NLP repo) to download and preprocess IWSLT'14 dataset:

$ cd preprocessing
$ source prepareData.sh

NOTE: this script requires Lua and luaTorch. As an alternative, you can download all necessary files from this repo

Run this command to create vocab.bin:

$ python utils/vocab.py --train_src data/nmt_iwslt/train.de-en.de --train_tgt data/nmt_iwslt/train.de-en.en --output data/nmt_iwslt/vocab.bin

Download pretrained word vectors and run preprocessing script:

$ mkdir data/fasttext
$ cd data/fasttext
$ wget https://s3-us-west-1.amazonaws.com/fasttext-vectors/wiki.de.vec
$ wget https://s3-us-west-1.amazonaws.com/fasttext-vectors/wiki.en.vec
$ python pretrained_embeddings.py

Usage

Specify train and model parameters in utils/param_provider.py
Run training

To train on CPU just execute train.py:

$ python train.py

To train on GPU:

$ source train_on_gpu.sh gpu_id

for example:

$ source train_on_gpu.sh 0

to run on GPU with id 0.

By default, the script train.py do not evaluates the model during training. If you want to evaluate, run it with key --do_eval:

$ python train.py --do_eval

Or, run the following script to train and eval on GPU:

$ source train_and_eval_on_gpu.sh gpu_id

Evaluate your model on test dataset

First, specify the model you want to evaluate as a 'start_model' in utils/param_provider.py.

Then, run the previously trained model in inference mode:

$ python inference.py

Or, to infer on GPU:

$ source inference_on_gpu.sh gpu_id

Last, run this command to evaluate BLEU score:

$ python compute_bleu.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Seq2Seq Neural Machine Translation model with different optimization techniques

Setting up

Usage

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
preprocessing		preprocessing
scripts		scripts
utils		utils
README.md		README.md
compute_bleu.sh		compute_bleu.sh
inference.py		inference.py
inference_on_gpu.sh		inference_on_gpu.sh
model.py		model.py
pretrained_embeddings.py		pretrained_embeddings.py
requirements.txt		requirements.txt
train.py		train.py
train_and_eval_on_gpu.sh		train_and_eval_on_gpu.sh
train_on_gpu.sh		train_on_gpu.sh

deeppavlov/seq2seq_scg

Folders and files

Latest commit

History

Repository files navigation

Seq2Seq Neural Machine Translation model with different optimization techniques

Setting up

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages