Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Cuman to languages #13

Closed
martino-vic opened this issue Oct 12, 2022 · 2 comments
Closed

Add Cuman to languages #13

martino-vic opened this issue Oct 12, 2022 · 2 comments

Comments

@martino-vic
Copy link
Collaborator

No description provided.

@martino-vic
Copy link
Collaborator Author

This throws following error:

Traceback (most recent call last):
File "/home/viktor/Documents/cldfvenv3.9/bin/cldfbench", line 8, in
sys.exit(main())
File "/home/viktor/Documents/cldfvenv3.9/lib/python3.9/site-packages/cldfbench/main.py", line 84, in main
return args.main(args) or 0
File "/home/viktor/Documents/cldfvenv3.9/lib/python3.9/site-packages/pylexibank/commands/makecldf.py", line 24, in run
with_dataset(args, 'makecldf', dataset=dataset)
File "/home/viktor/Documents/cldfvenv3.9/lib/python3.9/site-packages/cldfbench/cli_util.py", line 153, in with_dataset
res = func(*arg, args)
File "/home/viktor/Documents/cldfvenv3.9/lib/python3.9/site-packages/pylexibank/dataset.py", line 218, in _cmd_makecldf
super()._cmd_makecldf(args)
File "/home/viktor/Documents/cldfvenv3.9/lib/python3.9/site-packages/cldfbench/dataset.py", line 206, in _cmd_makecldf
self.cmd_makecldf(args)
File "/home/viktor/Documents/GitHub/rtbwestoldturkic/lexibank_rtbwestoldturkic.py", line 154, in cmd_makecldf
lex["ProsodicStructure"] = prosodic_string(lex["Segments"], _output='cv')
File "/home/viktor/Documents/cldfvenv3.9/lib/python3.9/site-packages/lingpy/sequence/sound_classes.py", line 881, in prosodic_string
[int(t) for t in tokens2class(string, rcParams['art'],
File "/home/viktor/Documents/cldfvenv3.9/lib/python3.9/site-packages/lingpy/sequence/sound_classes.py", line 792, in tokens2class
raise ValueError("[!] your sequence contains only unknown characters")
ValueError: [!] your sequence contains only unknown characters

@martino-vic
Copy link
Collaborator Author

Actually: Don't add Cuman to languages. It's enough when it's covered in the raw file but is ignored by the lexibank-script. Since in my own research I need only WOT, not Cuman. And because there are only a handful of Cuman words, with which one can't really solve any quantitative tasks anyways. Besides, the original dictionary contains heaps and heaps of other languages, that are currently ignored here as well. But for this I have opened #10 already

martino-vic added a commit that referenced this issue Apr 5, 2023
Minor changes like timestamp remain though. The entry in languages.csv would have been:
Cum	Cuman	Eurasia	46.91	19.66	cuma1241	qwm
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant