GitHub - ghchen18/leca: Code for Lexical-Constraint-Aware Neural Machine Translation via Data Augmentation

Code for paper - Lexical-constrained-aware neural machine translation

Install and Data preprocess

The code is implemented on fairseq v0.6.1, follow the same steps to install and prepare the processed fairseq dataset, the WMT process script is here.

Step 1: Install fairseq.

## you may want to build a conda environment first.
git clone https://github.com/ghchen18/leca.git
cd leca
pip install --editable .

Step 2: Process dataset

Follow the steps in the fairseq repo. More dataset can be found in WMT Translation Task. Because of the difference between the used dictionaries, the data preprocessing should use the preprocess.py in this repo instead of the official fairseq repo.

Run experiment

See scripts/run.sh. You may need to revise the variables in the shell scripts first according to your case.

Citation

@inproceedings{chen2020leca,
  title     = {Lexical-Constraint-Aware Neural Machine Translation via Data Augmentation},
  author    = {Chen, Guanhua and Chen, Yun and Wang, Yong and Li, Victor O.K.},
  booktitle = {Proceedings of {IJCAI} 2020: Main track},          
  pages     = {3587--3593},
  year      = {2020},
  month     = {7},
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
docs		docs
fairseq		fairseq
scripts		scripts
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
PATENTS		PATENTS
README.md		README.md
eval_lm.py		eval_lm.py
fairseq.gif		fairseq.gif
fairseq_logo.png		fairseq_logo.png
generate.py		generate.py
interactive.py		interactive.py
preprocess.py		preprocess.py
score.py		score.py
setup.py		setup.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Install and Data preprocess

Run experiment

Citation

About

Releases

Packages

Languages

License

ghchen18/leca

Folders and files

Latest commit

History

Repository files navigation

Install and Data preprocess

Run experiment

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages