UD_Amharic-ATT is a manual developed Treebanks for Amharic. Sentences were collected from grammar books, fictions, biographies, religious texts and news.


UD_Amharic-ATT is a manually annotated Treebanks. It is annotated for POS tag, morphological information and dependency relations. Since Amharic is a morphologically-rich, pro-drop, and languages having a feature of clitic doubling, clitics have been segmented manually.


The treebank is developed by Binyam Ephrem, Gashaw Arutie, and Tsegay Woldemariam. The syntactic annotation was checked and corrected manually by Binyam Ephrem.


  • Binyam Ephrem Seyoum ,Yusuke Miyao and Baye Yimam Mekonnen.2018.Universal Dependencies for Amharic. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), pp. 2216–2222, Miyazaki, Japan: European Language Resources Association (ELRA)


  • 2018-07-01 v2.2
    • First official release.
Data available since: UD v2.2
License: CC BY-SA 4.0
Includes text: yes
Genre: grammar-examples fiction nonfiction bible news
Lemmas: manual native
UPOS: manual native
XPOS: not available
Features: manual native
Relations: manual native
Contributors: Ephrem, Binyam; Arutie, Gashaw; Woldemariam, Tsegay; Navarro Horñiacek, Juan Ignacio
Contributing: elsewhere