Skip to content
Go to file

Latest commit


Git stats


Failed to load latest commit information.
Latest commit message
Commit time

Grammarly Corpus of Discourse Coherence (GCDC)

Description of this corpus and the accompanying code can be found in the following paper:

Discourse Coherence in the Wild: A Dataset, Evaluation and Methods
Alice Lai ( and Joel Tetreault (
Proceedings of the 19th Annual SIGDIAL Meeting on Discourse and Dialogue (SIGDIAL 2018)

GCDC was created in part using the Yahoo Answers corpus: L6 - Yahoo! Answers Comprehensive Questions and Answers version 1.0. The Yahoo Answers corpus can be requested free of charge for research purposes. Access to GCDC (and the accompanying code) will require users to first gain access to this Yahoo Answers corpus.

Once you have gained access to the L6 corpus, please forward the acknowledgment to Joel Tetreault (, along with your affiliation and a short description of how you will be using the data, and we will provide access to the Grammarly Corpus of Discourse Coherence and accompanying code. Please let us know if you have any questions.


Grammarly Corpus of Discourse Coherence and accompanying code for discourse coherence models



No releases published


No packages published
You can’t perform that action at this time.