Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

correct col "loans" #5

Closed
martino-vic opened this issue Apr 25, 2022 · 1 comment
Closed

correct col "loans" #5

martino-vic opened this issue Apr 25, 2022 · 1 comment

Comments

@martino-vic
Copy link
Collaborator

should only be "true" for recipient words (i.e. all lgs except WOT, since that's the donor lg)

@LinguList
Copy link
Collaborator

Yep, loans is very specific, you can add another more extensive notation, if you want. If it is not clear how to do that, let me know.

martino-vic pushed a commit that referenced this issue Apr 5, 2023
many work hours put into manual cleaning

Fixed #7 with a script + manual check
Fixed #5 manually. H, OH, LAH, EAH are always loans. Where WOT is a loan it's almost always from Mongolian

checked the corrigenda and addenda part in the end of the pdf and implemented that. Still need to write those changes to the comment field, so I won't think it's wrong, later on

Created a new column "Cum" for Cumanian, since some of the words from that langauge were wrongly in col WOT

Corrected column "Orthography" so that only the entries of modern Hungarian get a transcription

Manually checked entries that occured multiple times. In many cases redundant rows could be deleted.

Corrected the Orthography in the raw file in some cases

Improved the ipa-transciption file hun-wot.csv for epitran

deleted some redundant characters

Filled in missing entries (some weren't captured by the parsing script, for various reasons)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants