Skip to content

heeringa0/UD_Frisian-Frysk

Repository files navigation

# Summary

The UD Frisian-FA-RuG treebank is a West Frisian treebank. 

# Introduction

This corpus contains sentences from the multiple genres: news, science, novels,
culture, Wikipedia. The lemmas and PoS tags are manually verified. Morphologic
and syntactic annotations were generated on the basis of a literal Dutch
translation using an annotator that was trained on the basis of the UD Dutch
LassySmall 2.5 treebank.

# Acknowledgements

Martha Hofman verified the lemmas and POS tags of the larger part of the corpus.

# References

Heeringa, W., Bouma, G., Hofman, M., Drenth, E., Wijffels, J., & Van de Velde,
H. (2021). POS tagging, lemmatization and dependency parsing of West Frisian.
arXiv preprint arXiv:2107.07974.


=== Machine-readable metadata =================================================
Data available since: UD v1.0
License: CC BY-SA 4.0
Includes text: yes
Genre: news, science, novels, culture, Wikipedia
Lemmas: manually verified
UPOS: manually verified
XPOS: generated via Dutch
Features: generated via Dutch
Relations: generated via Dutch
Contributing: elsewhere
Contributors: Heeringa, Wilbert; Bouma, Gosse; Van de Velde, Hans
Contact: wheeringa@fryske-akademy.nl
===============================================================================

About

UD treebank of Frisian

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published