LeDaBo (Bororoan)
The Lexical Database of Bororoan comprises lexical data from 10 doculects, including various known forms of Bororoan and languages that are known to be genetically related. This comprehensive database includes manually assigned simple and partial cognates, colexifications, as well as valuable notes. It adheres to the standardized CLDF format, facilitating easy sharing and accessibility. The database encompasses a wide range of concepts, including the well-known Swadesh List, culturally relevant items, and species of fauna and flora.
The data for modern Bororo stems from the author's own fieldwork.
The transcriptions for modern Bororo will improve in the next version.
There is dialectal variation in Bororo, so that the transcription of the data for modern Bororo in the database referes to the variety spoken in the Indigenous Land Meruri.
If you use these data please cite this dataset using the DOI of the particular released version you were using.
- Varieties: 10
- Concepts: 341
- Lexemes: 1,228
- Sources: 12
- Synonymy: 1.16
- Cognacy: 1,418 cognates in 661 cognate sets (375 singletons)
- Cognate Diversity: 0.36
- Invalid lexemes: 0
- Tokens: 6,138
- Segments: 86 (0 BIPA errors, 0 CLTS sound class errors, 86 CLTS modified)
- Inventory size (avg): 35.00
Name | GitHub user | Description | Role |
---|---|---|---|
Fabrício Ferraz Gerardi | @LanguageStructure | Data Collector,cognacy assignment, co-lexifications, notes | Author |
The following CLDF datasets are available in cldf:
- CLDF Wordlist at cldf/cldf-metadata.json