Skip to content
Large corpus of uncompressed and compressed sentences from news articles.
Branch: master
Clone or download
Latest commit f03882c Apr 11, 2017
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
data Added the training portion of the dataset Apr 11, 2017
README.md

README.md

sentence-compression

Large corpus of uncompressed and compressed sentences from news articles.

The dataset is provided "AS IS" without any warranty, express or implied. Google disclaims all liability for any damages, direct or indirect, resulting from the use of the dataset.

The algorithm to collect the data is described here: Overcoming the Lack of Parallel Data in Sentence Compression, Katja Filippova and Yasemin Altun, Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing (EMNLP '13), pp. 1481-1491. (pdf)

You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session.