Skip to content

lukyjanek/universal-derivations

Repository files navigation

Universal Derivations

Introduction

Universal Derivations (UDer) is a collection of harmonised lexical networks capturing word-formation, especially derivation, in a cross-linguistically consistent annotation scheme for many languages. The annotation scheme is based on a rooted tree data structure (as used in the DeriNet 2.0 database), in which nodes correspond to lexemes while edges represent relations between lexemes (derivation, compounding, conversion, spelling variants etc.). For more details see Universal Derivations webpage.

Related tools

Each individual resource in the UDer collection can be:

For the DeriNet API, there are Jupyter Notebook manuals with:

Authors

Lukáš Kyjánek, Zdeněk Žaborktský, Jonáš Vidra, Magda Ševčíková. The original authors of individual data resources harmonised in this collection are listed in README.md file in each individual resource.

How to cite Universal Derivations

If you use the collection in your research or need to cite it for any reason, please cite:

@MISC{UDer-data,
 title      = {Universal Derivations v1.1},
 author     = {Kyj{\'a}nek, Luk{\'a}{\v s} and {\v Z}abokrtsk{\'y}, Zden{\v e}k and Vidra, Jon{\'a}{\v s} and {\v S}ev{\v c}{\'{\i}}kov{\'a}, Magda},
 url        = {http://hdl.handle.net/11234/1-3247},
 note       = {{LINDAT}/{CLARIAH}-{CZ} digital library at the Institute of Formal and Applied Linguistics ({{\'U}FAL}), Faculty of Mathematics and Physics, Charles University},
 copyright  = {Universal Derivations v1.0 License Agreement},
 year       = {2021}
}

References to individual data resources included in this collection are contained in README.md file in each individual resource. If you use the individual resources, plese cite them too.

Acknowledgment

We would like to thank all researchers who made their derivational resources publicly available under open licenses. This work was supported by:

  • the Grant No. GA19-14534S of the Czech Science Foundation,
  • the Grant No. START/HUM/010 of Grant schemes at Charles University (reg. No. CZ.02.2.69/0.0/0.0/19_073/0016935),
  • the Grant No. 1176219 of the Charles University Grant Agency,
  • the Grant No. SVV260575 of the Charles University,
  • the LINDAT/CLARIAH CZ project (No. LM2015071, No. LM2018101) of the Ministry of Education, Youth and Sports of the Czech Republic.

About

The scripts for compiling the Universal Derivations collections of harmonised word-formation resources for multiple langugaes.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published