Upper Sorbian data.
Switch branches/tags
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
CONTRIBUTING.md
LICENSE.txt
README.md
eval.log
hsb_ufal-ud-test.conllu
hsb_ufal-ud-train.conllu
stats.xml

README.md

Summary

A small treebank of Upper Sorbian based mostly on Wikipedia.

Introduction

The Upper Sorbian sentences are taken from the W2C corpus (Martin Majliš), which was further manually filtered, morphologically and syntactically annotated by Dan Zeman; lemmatization by Anna Nedoluzhko.

Sentences in the W2C corpus are shuffled.

Changelog

  • 2018-11-15 v2.3
    • Fixed some UPOS-DEPREL mismatches.
  • 2018-04-15 v2.2
    • Repository renamed from UD_Upper_Sorbian to UD_Upper_Sorbian-UFAL.
=== Machine-readable metadata (DO NOT REMOVE!) ================================
Data available since: UD v2.1
License: CC BY-SA 4.0
Includes text: yes
Genre: wiki nonfiction
Lemmas: manual native
UPOS: manual native
XPOS: manual native
Features: manual native
Relations: manual native
Contributors: Zeman, Daniel; Nedoluzhko, Anna
Contributing: here
Contact: zeman@ufal.mff.cuni.cz
===============================================================================