Add Cuman to languages #13

martino-vic · 2022-10-12T14:38:59Z

No description provided.

martino-vic · 2022-10-12T19:53:42Z

This throws following error:

Traceback (most recent call last):
File "/home/viktor/Documents/cldfvenv3.9/bin/cldfbench", line 8, in
sys.exit(main())
File "/home/viktor/Documents/cldfvenv3.9/lib/python3.9/site-packages/cldfbench/main.py", line 84, in main
return args.main(args) or 0
File "/home/viktor/Documents/cldfvenv3.9/lib/python3.9/site-packages/pylexibank/commands/makecldf.py", line 24, in run
with_dataset(args, 'makecldf', dataset=dataset)
File "/home/viktor/Documents/cldfvenv3.9/lib/python3.9/site-packages/cldfbench/cli_util.py", line 153, in with_dataset
res = func(*arg, args)
File "/home/viktor/Documents/cldfvenv3.9/lib/python3.9/site-packages/pylexibank/dataset.py", line 218, in _cmd_makecldf
super()._cmd_makecldf(args)
File "/home/viktor/Documents/cldfvenv3.9/lib/python3.9/site-packages/cldfbench/dataset.py", line 206, in _cmd_makecldf
self.cmd_makecldf(args)
File "/home/viktor/Documents/GitHub/rtbwestoldturkic/lexibank_rtbwestoldturkic.py", line 154, in cmd_makecldf
lex["ProsodicStructure"] = prosodic_string(lex["Segments"], _output='cv')
File "/home/viktor/Documents/cldfvenv3.9/lib/python3.9/site-packages/lingpy/sequence/sound_classes.py", line 881, in prosodic_string
[int(t) for t in tokens2class(string, rcParams['art'],
File "/home/viktor/Documents/cldfvenv3.9/lib/python3.9/site-packages/lingpy/sequence/sound_classes.py", line 792, in tokens2class
raise ValueError("[!] your sequence contains only unknown characters")
ValueError: [!] your sequence contains only unknown characters

martino-vic · 2022-10-12T19:57:14Z

Actually: Don't add Cuman to languages. It's enough when it's covered in the raw file but is ignored by the lexibank-script. Since in my own research I need only WOT, not Cuman. And because there are only a handful of Cuman words, with which one can't really solve any quantitative tasks anyways. Besides, the original dictionary contains heaps and heaps of other languages, that are currently ignored here as well. But for this I have opened #10 already

Minor changes like timestamp remain though. The entry in languages.csv would have been: Cum Cuman Eurasia 46.91 19.66 cuma1241 qwm

martino-vic closed this as completed Oct 12, 2022

martino-vic added a commit that referenced this issue Apr 5, 2023

did and undid #13

c112f66

Minor changes like timestamp remain though. The entry in languages.csv would have been: Cum Cuman Eurasia 46.91 19.66 cuma1241 qwm

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Cuman to languages #13

Add Cuman to languages #13

martino-vic commented Oct 12, 2022

martino-vic commented Oct 12, 2022

martino-vic commented Oct 12, 2022

Add Cuman to languages #13

Add Cuman to languages #13

Comments

martino-vic commented Oct 12, 2022

martino-vic commented Oct 12, 2022

martino-vic commented Oct 12, 2022