This repository is about Diamonds in the Rough: Generating Fluent Sentences from Early-Stage Drafts for Academic Writing Assistance. Specifically, this repository includes:
- Set of Modiﬁed Incomplete TecHnical paper sentences (SMITH) - an evaluation dataset of pairs of draft sentences and their ﬁnal versions
- Synthetic training dataset - a dataset including synthetic draft sentences used for training the baseline models
For the details, see the paper.