A small treebank of Upper Sorbian based mostly on Wikipedia.
The Upper Sorbian sentences are taken from the W2C corpus (Martin Majliš), which was further manually filtered, morphologically and syntactically annotated by Dan Zeman; lemmatization by Anna Nedoluzhko.
Sentences in the W2C corpus are shuffled.
- 2018-05-15 v2.4
goeswithno longer used in situations with punctuation or gaps.
- 2018-11-15 v2.3
- Fixed some UPOS-DEPREL mismatches.
- 2018-04-15 v2.2
- Repository renamed from UD_Upper_Sorbian to UD_Upper_Sorbian-UFAL.
=== Machine-readable metadata (DO NOT REMOVE!) ================================ Data available since: UD v2.1 License: CC BY-SA 4.0 Includes text: yes Genre: wiki nonfiction Lemmas: manual native UPOS: manual native XPOS: manual native Features: manual native Relations: manual native Contributors: Zeman, Daniel; Nedoluzhko, Anna Contributing: here Contact: firstname.lastname@example.org ===============================================================================