Grammarly Corpus of Discourse Coherence and accompanying code for discourse coherence models
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.

Grammarly Corpus of Discourse Coherence (GCDC)

Description of this corpus and the accompanying code can be found in the following paper:

Discourse Coherence in the Wild: A Dataset, Evaluation and Methods
Alice Lai ( and Joel Tetreault (
To appear in the Proceedings of the 19th Annual SIGDIAL Meeting on Discourse and Dialogue (SIGDIAL 2018)

GDCD was created in part using the Yahoo Answers corpus: L6 - Yahoo! Answers Comprehensive Questions and Answers version 1.0. The Yahoo Answers corpus can be requested free of charge for research purposes. Access to GDCD (and the accompanying code) will require users to first gain access to this Yahoo Answers corpus.

Once you have gained access to the L6 corpus, please forward the acknowledgment to Joel Tetreault (, along with your affiliation and a short description of how you will be using the data, and we will provide access to the Grammarly Corpus of Discourse Coherence and accompanying code. Please let us know if you have any questions.