Multilingual Neural Machine Translation with Soft Decoupled Encoding

This is the code we used in our paper

Multilingual Neural Machine Translation with Soft Decoupled Encoding

Xinyi Wang, Hieu Pham, Philip Arthur, Graham Neubig

Requirements

Python 3.6, PyTorch 0.4.1

All the scripts for experiments in the paper can be created from the templates under scripts/template/

Data Processing

The data we use is multilingual TED corpus by Qi et al.

We provide preprocessed version of the data, which you can get from here: If you are interested int the details of data processing, you can take a look at the script make-eng.sh and make-data.sh.

Training:

The template name for the following methods are:

SDE: bi-semb-bq-o32000
subword: bi-sw-32000
subword-joint: bi-sw-joint-32000
word: bi-w-64000

To make the main experiment scripts for alll 4 languages tested in the paper, simply call bash make-cfg.sh

Decoding:

To make decode scripts, simply use the file make-trans.py. Change the name of the directory where the experiment outputs are stored if you modify the template scripts during training. Otherwise it should just work by calling: python make-trans.py

Implementation details

If you are interested in the implementation of SDE: All the components of SDE is implemented in a encoder class here. It is a RNN encoder that encodes words using SDE.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
scripts/template		scripts/template
src		src
.gitignore		.gitignore
README.md		README.md
cut-corpus.py		cut-corpus.py
make-abl-cfg.sh		make-abl-cfg.sh
make-cfg.sh		make-cfg.sh
make-data.sh		make-data.sh
make-eng.sh		make-eng.sh
make-trans-cfg.sh		make-trans-cfg.sh
make-trans.py		make-trans.py
submit-train-0.sh		submit-train-0.sh
submit-train-1.sh		submit-train-1.sh
submit-train-2.sh		submit-train-2.sh
submit-train-3.sh		submit-train-3.sh
submit-train.sh		submit-train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multilingual Neural Machine Translation with Soft Decoupled Encoding

Requirements

Data Processing

Training:

Decoding:

Implementation details

About

Releases

Packages

Languages

cindyxinyiwang/SDE

Folders and files

Latest commit

History

Repository files navigation

Multilingual Neural Machine Translation with Soft Decoupled Encoding

Requirements

Data Processing

Training:

Decoding:

Implementation details

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages