We are building a Russian Drama Corpus with files encoded in TEI-P5. Our corpus comprises 68 plays so far, stemming from ilibrary, Wikisource and РВБ, converted into TEI and corrected by us. There will be more.
If you just want to download the corpus with TEI files, do this:
svn export https://github.com/lehkost/RusDraCor/trunk/tei
RusDraCor will first be presented on June 29, 2017, at the Corpora 2017 conference in St. Petersburg (our slides here) and on July 11, 2017, at the "Digitizing the stage" conference in Oxford. The social network data we extracted so far can be explored with our Shinyapp.