Exploiting Curriculum Learning in Unsupervised Neural Machine Translation

This is the repo for the paper - "Exploiting Curriculum Learning in Unsupervised Neural Machine Translation" (to appear in EMNLP Findings 2021.)

Introduction

This paper exploits curriculum learning (CL) in unsupervised neural machine translation (UNMT). Specifically, we design methods to estimate the quality of pseudo bi-text and apply CL framework to improve UNMT. Please refer to the paper for more details.

Dependencies

Python 3
NumPy
PyTorch
fastBPE (generate and apply BPE codes)
Moses (scripts to clean and tokenize text only - no installation required)
Apex (for fp16 training)

Prepare Difficulty File

Difficulty computation needs cross-lingual word embeddings, which are obtained by unsupervised training method MUSE. In fact, you can use the cross-lingual distances of word pairs which are extract by us (They are store in the directory CL_diff/data). Then, You can run the following command to compute the difficulty file for your training data.

python CL_diff/compute_tfidf_wordtrans_diff.py <DISTANCE_FILE> <TRAINING_DATA_FILE> <OUTPUT_FILE>

Train an UNMT model

This repo is modified based on XLM toolkit and MASS. You can run the model through following commands.

For XLM:

bash CL_XLM/run_unmt_ende.sh

For MASS:

bash CL_MASS/run_unmt_enro.sh

If you have multiple GPUs, please modify the scripts according to XLM README

Pre-trained Language Models

For en-de, en-fr, en-ro, please download from XLM README and MASS README.

For en-zh, our model can be download through the following link.

Link	Password
https://pan.baidu.com/s/1vTQDjWF119EITVIHew-leA	tkvn

Reference

@article{lu2021,
  title={Exploiting Curriculum Learning in Unsupervised Neural Machine Translation},
  author={Jinliang, Lu and Jiajun, Zhang},
  booktitle={Findings of the Empirical Methods in Natural Language Processing: EMNLP 2021},
  year={2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
CL_MASS		CL_MASS
CL_XLM		CL_XLM
CL_diff		CL_diff
images		images
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Exploiting Curriculum Learning in Unsupervised Neural Machine Translation

Introduction

Dependencies

Prepare Difficulty File

Train an UNMT model

Pre-trained Language Models

Reference

About

Releases

Packages

Languages

JinliangLu96/CL_UNMT

Folders and files

Latest commit

History

Repository files navigation

Exploiting Curriculum Learning in Unsupervised Neural Machine Translation

Introduction

Dependencies

Prepare Difficulty File

Train an UNMT model

Pre-trained Language Models

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages