Skip to content

Latest commit

 

History

History
43 lines (30 loc) · 1.82 KB

README.md

File metadata and controls

43 lines (30 loc) · 1.82 KB

Summary

UD_Czech-Poetry contains random samples of Czech 19th-century poetry from the Corpus of Czech Verse parsed with UDPipe2 (trained on UD Czech-PDT 2.11) and manually corrected.

Introduction

The treebank consists of 29 randomly selected poems from the Corpus of Czech Verse parsed with UDPipe 2 (trained on UD Czech-PDT 2.11) and manually corrected to comply with the UD release of the FicTree treebank.

Acknowledgments

This work was supported by the Czech Science Foundation grant No. 23-07727S, European Poetry: Distant Reading. This work has used the tools and data provided by the LINDAT/CLARIAH-CZ project LM2023062; formerly LM2010013, LM2015071, LM2018101, supported by the Czech Ministry of Education, Sports and Youth under the programme LM of "Large Infrastructures".

References

  • Tomáš Jelínek / Daniel Zeman (2022): UD_Czech-FicTree (v2.10) https://github.com/UniversalDependencies/UD_Czech-FicTree .
  • Jelínek, Tomáš (2017): FicTree: A Manually Annotated Treebank of Czech Fiction. in: ITAT (= CEUR Workshop Proceedings). CEUR-WS.org. 181–185. (= CEUR Workshop Proceedings).
  • Plecháč, Petr / Kolár, Robert (2015): "The corpus of Czech verse", in: Studia metrica et poetica 2 (1): 107–118.

Changelog

  • 2023-11-15 v2.13
    • Initial release in Universal Dependencies.
=== Machine-readable metadata (DO NOT REMOVE!) ================================
Data available since: UD v2.13
License: CC BY-SA 4.0
Includes text: yes
Genre: poetry
Lemmas: manual native
UPOS: manual native
XPOS: automatic
Features: manual native
Relations: manual native
Contributors: Cinková, Silvie
Contributing: here
Contact: cinkova@ufal.mff.cuni.cz
===============================================================================