Skip to content

Latest commit

 

History

History

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 

To download the file go to https://cs.nyu.edu/~kcho/DMQA/. Once the files have been split use Stanford NLP to get the XML representation:

./corenlp.sh -annotators tokenize,ssplit,pos,lemma,ner,parse,depparse,coref -coref.algorithm neural -filelist path/to/filelist.txt outputFormat xml -outputDirectory /path/to/output/xml

Then to process the data run

python convert2graph.py /path/to/output/xml /path/to/summaries /path/to/output