Skip to content
Permalink
Branch: master
Find file Copy path
Find file Copy path
Fetching contributors…
Cannot retrieve contributors at this time
50 lines (41 sloc) 1.86 KB

Summary

UD Karelian-KKPP is a manually annotated new corpus of Karelian made in Universal dependencies annotation scheme. The data is collected from VepKar corpora and consists of mostly modern news texts but also some stories and educational texts.

Introduction

UD Karelian-KKPP is a manually annotated new corpus of Karelian made in Universal dependencies annotation scheme. The data is collected from VepKar corpora and consists of mostly modern news texts but also some stories and educational texts. We have based many decisions in the dependency annotations on pre-existing Finnish treebanks. The morphological annotations and grammar are based on the refered books (Zaikov, Ahtia) and the Karelian dictionary, with necessary orthographical modernisations.

Acknowledgments

Finnish treebank developers for a good reference treebank and also SETS dep search which has been very useful in finding equivalent examples from Finnish treebanks.

References

  • Zaikov, Pekka. Vienankarjalan kielioppi. Lisänä harjotukšie ta lukemisto (2013).
  • Ahtia, Edvard Vilhelm. Karjalan kielioppi. Karjalan Kansalaisseura, (1938).
  • Karjalan kielen verkkosanakirja
=== Machine-readable metadata (DO NOT REMOVE!) ================================
Data available since: UD v2.4
License: CC BY-SA 4.0
Includes text: yes
Genre: nonfiction news web
Lemmas: manual native
UPOS: manual native
XPOS: not available
Features: manual native
Relations: manual native
Contributors: Pirinen, Tommi A
Contributing: elsewhere
Contact: tommi.antero.pirinen@uni-hamburg.de
===============================================================================
You can’t perform that action at this time.