Releases: redewiedergabe/corpus
Releases · redewiedergabe/corpus
Final release - core corpus + additional material
First public release of the "Redewiedergabe" corpus
This beta release contains the first subset of the "Redewiedergabe" corpus.
It includes 619 text samples and 360,974 tokens. 9,451 STWR instances have been annotated, as well as additional information like frames, introductory expressions and speakers.
Available formats are TEI compatible XML and a column-based text format.