Data and source code accompanying the paper "Split and Rephrase: Better Evaluation and a Stronger Baseline".
Roee Aharoni and Yoav Goldberg, ACL 2018
The data and some of the scripts are based on the repository by Narayan et al.: https://github.com/shashiongithub/Split-and-Rephrase
This repository includes:
The proposed data split, under
Training our proposed models using openNMT-py (under
Evaluating the models as proposed by Narayan et al., 2017 (under
Creating the RDF-based data split to reduce overlap between the development and test set found in the original split (under
Feel free to reach out in
firstname.lastname@example.org if you have any further questions!