Skip to content

Latest commit

 

History

History
40 lines (28 loc) · 1.51 KB

README.md

File metadata and controls

40 lines (28 loc) · 1.51 KB

Summary

An Ottoman Turkish dependency treebank annotated in UD style. Created by Enes Yılandiloğlu.

Introduction

This project comprises 85 sentences that are firstly automaticaly annotated via machamp (Van der Goot et al., 2021). During the training phase, multiple modern Turkish UD treebanks were used. and then manually corrected in a systematic way. Randomly shuffled sentences were written between 14th to 20th century in various genres such as fiction, news, article, registry record, and religious preach. Unfortunately, for this version, the genres can not be told apart by sentence ids. The order of the sentences is chronology based rather than genre based, the earliest written sentence is at the top. In this treebank, Ottoman Turkish transcription alphabet is used.

Acknowledgments

I am immensely grateful to Fatma Elcan for her tremendous help in providing me with sentences.

Changelog

  • 2024-05-15 v2.14
    • Initial release in Universal Dependencies.
=== Machine-readable metadata (DO NOT REMOVE!) ================================
Data available since: UD v2.14
License: CC BY-SA 4.0
Includes text: yes
Genre: news fiction nonfiction bible government
Lemmas: manual native
UPOS: manual native
XPOS: manual native
Features: manual native
Relations: manual native
Contributors: Yılandiloğlu, Enes
Contributing: here
Contact: enes.yilandiloglu@helsinki.fi
===============================================================================