This repository includes the data sets for idiom translation from German to English and English to German proposed in our paper.
If you use this data set, please cite:
@ARTICLE{2018arXiv180204681F,
author = {{Fadaee}, M. and {Bisazza}, A. and {Monz}, C.},
title = "{Examining the Tip of the Iceberg: A Data Set for Idiom Translation}",
journal = {ArXiv e-prints},
archivePrefix = "arXiv",
eprint = {1802.04681},
primaryClass = "cs.CL",
keywords = {Computer Science - Computation and Language},
year = 2018,
month = feb,
adsurl = {http://adsabs.harvard.edu/abs/2018arXiv180204681F},
adsnote = {Provided by the SAO/NASA Astrophysics Data System}
}
This data set is originally from the WMT shared task 2017:
- Findings of the 2017 Conference on Machine Translation, Ondřej Bojar, Rajen Chatterjee, Christian Federmann, Yvette Graham, Barry Haddow, Shujian Huang, Matthias Huck, Philipp Koehn, Qun Liu, Varvara Logacheva, Christof Monz, Matteo Negri, Matt Post, Raphael Rubino, Lucia Specia and Marco Turchi