Skip to content

ufal/nmt-pe-effects-2021

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Neural Machine Translation Quality and Post-Editing Performance

This is a repository for an experiment relating NMT quality and post-editing efforts, presented at EMNLP2021 (presentation recording). Please cite the following paper when you use this research:

@inproceedings{zouhar2021neural,
  title={Neural Machine Translation Quality and Post-Editing Performance},
  author={Zouhar, Vil{\'e}m and Popel, Martin and Bojar, Ond{\v{r}}ej and Tamchyna, Ale{\v{s}}},
  booktitle={Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing},
  pages={10204--10214},
  year={2021},
  url={https://aclanthology.org/2021.emnlp-main.801/}
}

You can access the data on huggingface:

from datasets import load_dataset
data_p1 = load_dataset("zouharvi/nmt-pe-effects", "phase_1")
data_p2 = load_dataset("zouharvi/nmt-pe-effects", "phase_2")

Paper video presentation

Documents

Document name domain lines src words description
Hole news 10 272 sky.com.20667
Whistle news 6 219 nytimes.231903
China news 12 357 en.ndtv.com.13143
Turner news 8 303 euronews-en.185744
QTLeap technical 11 178 Batch2q_cs_v1010, Batch1a_en_v1.4.NAF
Lease agreement 29 597 Translated by Ondřej
Audit (I)ntroduction audit 17 433 Provided by NKÚ
Audit (R)ecommendation audit 6 121 Provided by NKÚ
Total 99 2480

Acknowledgement

We sincerely thank České překlady for their collaboration and for providing all translations and revisions. The work was supported by Memsource and by the grants 19-26934X (NEUREM3) and 20-16819X (LUSyD) by the Czech Science Foundation. The work has been using language resources developed and distributed by the LINDAT/CLARIAHCZ project of the Ministry of Education, Youth and Sports of the Czech Republic (project LM2018101)

Related References

  1. https://www.cs.jhu.edu/~phi/publications/machine-translation-quality.pdf
  2. https://www.dlsi.ua.es/~fsanchez/pub/pdf/forcada17a.pdf

About

Experiment relating NMT quality and post-editing efforts

Resources

Stars

Watchers

Forks

Contributors 4

  •  
  •  
  •  
  •