Skip to content

CLDF dataset derived from Joo's "Phonosemantic Biases" from 2019

License

Notifications You must be signed in to change notification settings

lexibank/joophonosemantic

Repository files navigation

CLDF dataset derived from Joo's "Phonosemantic Biases" from 2020

CLDF validation

How to cite

If you use these data please cite

Description

This dataset is licensed under a CC-BY-4.0 license

Available online at https://github.com/ianjoo/LJ-List/

Conceptlists in Concepticon:

Notes

The LJ list of Bukiyip was not disclosed due to the request from the contributor.

Statistics

CLDF validation Glottolog: 98% Concepticon: 100% Source: 93% BIPA: 100% CLTS SoundClass: 100%

  • Varieties: 65 (linked to 64 different Glottocodes)
  • Concepts: 100 (linked to 100 different Concepticon concept sets)
  • Lexemes: 6,171
  • Sources: 111
  • Synonymy: 1.00
  • Invalid lexemes: 0
  • Tokens: 25,044
  • Segments: 324 (0 BIPA errors, 0 CLTS sound class errors, 324 CLTS modified)
  • Inventory size (avg): 30.18

Possible Improvements:

  • Entries missing sources: 450/6171 (7.29%)

Contributors

Name GitHub user Description Role
Ian Joo @ianjoo author
Johann-Mattis List @LinguList cldf conversion Editor
Christoph Rzymski @chrzyki cldf conversion Editor

CLDF Datasets

The following CLDF datasets are available in cldf: