UD Breton-KEB is a treebank of Breton that has been manually annotated according to the Universal Dependencies guidelines. The tokenisation guidelines and morphological annotation comes from a finite-state morphological analyser of Breton released as part of the Apertium project.
The text in the corpus comes from a variety of sources: Grammar book sentences, sentences from the Breton Wikipedia, administrative texts from Ofis Publik ar Brezhoneg and short prose texts from Bremaik. In addition there are two traditional songs.
Out of 888 sentences, 256 have translations in French, 275 have translations in English and 347 have translations which have been machine translated from Breton to French using Apertium
- 2018-11-15 v1.0
- First release in UD
We would like to thank Fulup Jakez and other members of Ofis Publik ar Brezhoneg for constant help and support over the years.
If you use this treebank in your work, please cite:
@inproceedings{tyers:2018a,
author = {Francis M. Tyers and Vinit Ravishankar},
title = {A prototype dependency treebank for Breton},
booktitle = {Actes de la 25e conférence sur le Traitement Automatique des Langues Naturelles (TALN)},
note = {\emph{to appear}},
year = 2018
}
=== Machine-readable metadata (DO NOT REMOVE!) ================================ Data available since: UD v2.2 License: CC BY-SA 4.0 Includes text: yes Genre: grammar-examples wiki nonfiction fiction news poetry Lemmas: converted from manual UPOS: converted from manual XPOS: manual native Features: converted from manual Relations: manual native Contributors: Tyers, Francis Contributing: elsewhere Contact: ftyers@iu.edu ===============================================================================