Skip to content

SequenceComparison/listsamplesize

Repository files navigation

CLDF Dataset derived from List's "Sample Size and Cognate Detection" from 2014

CLDF validation

How to cite

If you use these data please cite

Description

This dataset is licensed under a CC-By-4.0 license

Available online at https://github.com/SequenceComparison/SupplementaryMaterial/zipball/master

Conceptlists in Concepticon:

Statistics

CLDF validation Glottolog: 100% Concepticon: 100% Source: 100% BIPA: 100% CLTS SoundClass: 100%

  • Varieties: 4 (linked to 4 different Glottocodes)
  • Concepts: 550 (linked to 549 different Concepticon concept sets)
  • Lexemes: 2,429
  • Sources: 1
  • Synonymy: 1.10
  • Cognacy: 2,429 cognates in 1,598 cognate sets (1,077 singletons)
  • Cognate Diversity: 0.56
  • Invalid lexemes: 0
  • Tokens: 11,290
  • Segments: 82 (0 BIPA errors, 0 CLTS sound class errors, 82 CLTS modified)
  • Inventory size (avg): 41.75

Contributors

Name GitHub user Description Role
Johann-Mattis List @LinguList maintainer Author, Editor

CLDF Datasets

The following CLDF datasets are available in cldf:

About

CLDF dataset derived from List's "Sample Size in Cognate Detection" from 2014

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 3

  •  
  •  
  •