Skip to content

Latest commit



53 lines (41 loc) · 1.85 KB

File metadata and controls

53 lines (41 loc) · 1.85 KB


UD Breton-KEB is a treebank of Breton that has been manually annotated according to the Universal Dependencies guidelines. The tokenisation guidelines and morphological annotation comes from a finite-state morphological analyser of Breton released as part of the Apertium project.


The text in the corpus comes from a variety of sources: Grammar book sentences, sentences from the Breton Wikipedia, administrative texts from Ofis Publik ar Brezhoneg and short prose texts from Bremaik. In addition there are two traditional songs.


Out of 888 sentences, 256 have translations in French, 275 have translations in English and 347 have translations which have been machine translated from Breton to French using Apertium


  • 2018-11-15 v1.0
    • First release in UD


We would like to thank Fulup Jakez and other members of Ofis Publik ar Brezhoneg for constant help and support over the years.

If you use this treebank in your work, please cite:

    author = {Francis M. Tyers and Vinit Ravishankar},
    title = {A prototype dependency treebank for Breton},
    booktitle = {Actes de la 25e conférence sur le Traitement Automatique des Langues Naturelles (TALN)},
    note = {\emph{to appear}},
    year = 2018
=== Machine-readable metadata (DO NOT REMOVE!) ================================
Data available since: UD v2.2
License: CC BY-SA 4.0
Includes text: yes
Genre: grammar-examples wiki nonfiction fiction news poetry
Lemmas: converted from manual
UPOS: converted from manual
XPOS: manual native
Features: converted from manual
Relations: manual native
Contributors: Tyers, Francis
Contributing: elsewhere