UD_Abkhaz-AbNC is a treebank based on texts from the Abkhaz National Corpus, AbNC.
UD_Abkhaz-AbNC is a treebank based on texts from the Abkhaz National Corpus, AbNC, which is a corpus of written texts from a variety of genres. The sentences for the initial release of the treebank are taken from a collection of fairy tales for children (Аҧсуа лакәқәа – Ахәыҷқәа рзы, editor: Мықәба, А.).
The sentences are analysed using a finite state morphological analyser, and Constraint Grammar rules for disambiguation and dependency parsing. Both disambiguation and dependency analyses are corrected manually in a tool specifically developed for that purpose.
The Abkhaz treebank and the tools used to create it have been developed by Paul Meurer.
I am grateful to Saida Adzhindzhal (Suchum) for helping me understanding some of the constructions in the texts.
- Paul Meurer. A finite state approach to Abkhaz morphology and stress. Lecture Notes in Computer Science 2011, Volume 6618. pp. 271-282.
- Paul Meurer. The Abkhaz National Corpus. Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan 2018. pp. 2456–2460.
- Paul Meurer. Towards a Treebank of Abkhaz. The AbNC, Analysing Abkhaz, and the Importance of Good Tools. Digital Kartvelology, Volume II (2023).
- 2024-05-15 v2.14
- Initial release in Universal Dependencies.
=== Machine-readable metadata (DO NOT REMOVE!) ================================ Data available since: UD v2.14 License: CC BY-SA 4.0 Includes text: yes Genre: fiction Lemmas: manual native UPOS: manual native XPOS: not available Features: manual native Relations: manual native Contributors: Meurer, Paul Contributing: here Contact: paul.meurer@uib.no ===============================================================================