Description of this corpus and the accompanying code can be found in the following paper:
Discourse Coherence in the Wild: A Dataset, Evaluation and Methods
Alice Lai (aylai2@illinois.edu) and Joel Tetreault (joel.tetreault@grammarly.com)
Proceedings of the 19th Annual SIGDIAL Meeting on Discourse and Dialogue (SIGDIAL 2018)
GCDC was created in part using the Yahoo Answers corpus: L6 - Yahoo! Answers Comprehensive Questions and Answers version 1.0. The Yahoo Answers corpus can be requested free of charge for research purposes. Access to GCDC (and the accompanying code) will require users to first gain access to this Yahoo Answers corpus.
Once you have gained access to the L6 corpus, please forward the acknowledgment to Joel Tetreault (tetreaul@gmail.com), along with your affiliation and a short description of how you will be using the data, and we will provide access to the Grammarly Corpus of Discourse Coherence and accompanying code. Please let us know if you have any questions.