Skip to content

A coreference corpus of Conchucos Quechua (ISO 639-3 code qxo).

Notifications You must be signed in to change notification settings

elizabethpankratz/qxoRef

Repository files navigation

qxoRef

This repository contains the qxoRef coreference corpus for Conchucos Quechua (ISO 639-3 code qxo). The corpus consists of twelve stories told by native Quechua speakers in Huari, Peru, in 2015. These stories have been transcribed and extended with morphological analysis and mention annotation (including singletons). All files are provided in the CoNLL format, following the CoNLL-U word segmentation guidelines.

qxoRef is available under a CC-BY-NC-SA 4.0 license.

To cite

Pankratz, E. (2021). qxoRef 1.0: A coreference corpus and mention-pair baseline for coreference resolution in Conchucos Quechua. Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas (AmericasNLP-NAACL 2021), 1–9. https://doi.org/10.18653/v1/2021.americasnlp-1.1

The original data

The transcriptions and morphological analyses included in qxoRef come from the DFG-funded project 274614727 (PI Uli Reich; project website incl. documentation here). The original data from this project is available under a CC-BY-NC-SA 4.0 license at the Freie Universität Berlin's Refubium. All data in qxoRef comes from the "cuento" subtask.

About

A coreference corpus of Conchucos Quechua (ISO 639-3 code qxo).

Resources

Stars

Watchers

Forks

Packages

No packages published