Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cognate Sets Need Review #37

Open
LinguList opened this issue Jul 11, 2021 · 10 comments
Open

Cognate Sets Need Review #37

LinguList opened this issue Jul 11, 2021 · 10 comments

Comments

@LinguList
Copy link
Collaborator

I just inspected a couple of cognate sets in the data out of interest and found that there are often cases where I think one should make a more fine-grained distinction rather than lumping entries into one cognate set.

Specifically clear is this when looking at the alignments, even if one does not really align the data.

Bildschirmfoto_2021-07-11_18-12-49

@LinguList
Copy link
Collaborator Author

In this case, it looks very implausible to assume that k corresponds to m which almost never really occurs. Moreover, the form in Karo suffers from a misstokenization, as is obviously one sound.

@LinguList
Copy link
Collaborator Author

We can pull out implausible sound correspondences, if you want to, and it seems useful to do so, as this would probably also enhance the quality of the data. By now, quick eyeballing reveals numerous cases where it is hard to defend that the words are cognate, unless you provide me with some really deep insights into morphological patterns.

However: morphological patterns should not be aligned anyway, so if you know that kam is similar to nam for some morphological process, they would still be two different cognate sets, and you'd add one root identifiers (ROOTID) that unifies them.

@LanguageStructure
Copy link
Collaborator

LanguageStructure commented Jul 11, 2021 via email

@LanguageStructure
Copy link
Collaborator

LanguageStructure commented Jul 11, 2021 via email

@LanguageStructure
Copy link
Collaborator

@LinguList cognac on edictor has been improved compared to the cognates you see in the published version of TuLeD. Besides improved cognacy, we also have more concepts, languages, more partial cognates

@LinguList
Copy link
Collaborator Author

LinguList commented Jul 11, 2021 via email

@LinguList
Copy link
Collaborator Author

LinguList commented Jul 11, 2021 via email

@LanguageStructure
Copy link
Collaborator

LanguageStructure commented Jul 11, 2021 via email

@LinguList
Copy link
Collaborator Author

LinguList commented Jul 11, 2021 via email

@LanguageStructure
Copy link
Collaborator

LanguageStructure commented Jul 11, 2021 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants