Skip to content

tupian-language-resources/bororo

Repository files navigation

LeDaBo (Bororoan)

The Lexical Database of Bororoan comprises lexical data from 10 doculects, including various known forms of Bororoan and languages that are known to be genetically related. This comprehensive database includes manually assigned simple and partial cognates, colexifications, as well as valuable notes. It adheres to the standardized CLDF format, facilitating easy sharing and accessibility. The database encompasses a wide range of concepts, including the well-known Swadesh List, culturally relevant items, and species of fauna and flora.

The data for modern Bororo stems from the author's own fieldwork.

The transcriptions for modern Bororo will improve in the next version.

There is dialectal variation in Bororo, so that the transcription of the data for modern Bororo in the database referes to the variety spoken in the Indigenous Land Meruri.

How to cite

If you use these data please cite this dataset using the DOI of the particular released version you were using.

DOI

Statistics

Glottolog: 62% Concepticon: 87% Source: 100% BIPA: 100% CLTS SoundClass: 100%

  • Varieties: 10
  • Concepts: 341
  • Lexemes: 1,228
  • Sources: 12
  • Synonymy: 1.16
  • Cognacy: 1,418 cognates in 661 cognate sets (375 singletons)
  • Cognate Diversity: 0.36
  • Invalid lexemes: 0
  • Tokens: 6,138
  • Segments: 86 (0 BIPA errors, 0 CLTS sound class errors, 86 CLTS modified)
  • Inventory size (avg): 35.00

Contributors

Name GitHub user Description Role
Fabrício Ferraz Gerardi @LanguageStructure Data Collector,cognacy assignment, co-lexifications, notes Author

CLDF Datasets

The following CLDF datasets are available in cldf: