Permalink
Find file Copy path
Fetching contributors…
Cannot retrieve contributors at this time
107 lines (77 sloc) 3.37 KB

Summary

This Universal Dependencies Latin Treebank consists of an automatic conversion of a selection of passages from the Ancient Greek and Latin Dependency Treebank 2.1

Introduction

The current UD treebank derives from texts taken from the Ancient Greek and Latin Dependency Treebank 2.1 available at

The original data have been semi-automatically annotated. More precisely, morphological annotation and lemmatization have been performed with the help of the Morpheus morphological analyzer, while syntactic annotation has been done manually. The following guidelines have been followed:

Further details can be found at:

This UD release contains parts of the following works:

author work
Augustus Res Gestae
Cicero In Catilinam
Jerome Vulgata
Vergil Aeneid
Ovid Metamorphoses
Petronius Satyricon
Phaerus Fabulae
Propertius Elegies
Sallust Bellum Catilinae
Suetonius Life of Augustus
Tacitus Historiae

Acknowledgement

The current UD data have been converted by Giuseppe G. A. Celano.

The Ancient Greek and Latin treebank is a result of a joint effort between Tufts University and Leipzig University (DH) under the supervision of Prof. Gregory Crane. Current editors of the treebank are Giuseppe G. A. Celano, Gregory Crane, and Bridget Almas.

Authors of the annotations are (in alphabetical order):

Giuseppe G. A. Celano, J. F. Gentile, Robert Gorman, Vanessa Gorman, Jordan Hawkesworth, Yoana Ivanova, Tovah Keynton, Florin Leonte, Alex Lessie, Daniel Lim Libatique, Meg Luthin, Francesco Mambrini, George Matthews, Jack Mitchell, Molly Miller, Jessica Nord, Sean Stewart, Anthony D. Yates, Polina Yordanova, and Sam Zukoff.

Further details can be found at:

Basic statistics

Tree count: 2273 Word count: 29138 Token count: 29138 Dep. relations: 24 of which 0 language specific POS tags: 12 Category=value feature pairs: 34

References:

Bamman, David and Gregory Crane. 2011. The Ancient Greek and Latin Dependency Treebanks. 2011. In Caroline Sporleder, Antal van den Bosch, Kalliopi Zervanou (eds.), Language Technology for Cultural Heritage, 79-98.

Celano, Giuseppe G. A., Gregory Crane, and Bridget Almas. 2014. The Ancient Greek and Latin Dependency treebank 2.0. https://github.com/PerseusDL/treebank_data

Changelog

  • 2018-04-15 v2.2

    • Repository renamed from UD_Latin to UD_Latin-Perseus.
    • Numerical suffixes of senses removed from lemmas and stored as LId attributes in the MISC column.
  • Since UD v1.2 new texts have been added. More information on the changes in the official github repository (see link above).

=== Machine-readable metadata (DO NOT REMOVE!) ================================
Data available since: UD v1.2
License: CC BY-NC-SA 2.5
Includes text: yes
Genre: fiction nonfiction bible
Lemmas: converted from manual
UPOS: converted from manual
XPOS: manual native
Features: converted from manual
Relations: converted from manual
Contributors: Celano, Giuseppe G. A.; Zeman, Daniel
Contributing: elsewhere
Contact: celano@informatik.uni-leipzig.de
===============================================================================