You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
many work hours put into manual cleaning
Fixed#7 with a script + manual check
Fixed#5 manually. H, OH, LAH, EAH are always loans. Where WOT is a loan it's almost always from Mongolian
checked the corrigenda and addenda part in the end of the pdf and implemented that. Still need to write those changes to the comment field, so I won't think it's wrong, later on
Created a new column "Cum" for Cumanian, since some of the words from that langauge were wrongly in col WOT
Corrected column "Orthography" so that only the entries of modern Hungarian get a transcription
Manually checked entries that occured multiple times. In many cases redundant rows could be deleted.
Corrected the Orthography in the raw file in some cases
Improved the ipa-transciption file hun-wot.csv for epitran
deleted some redundant characters
Filled in missing entries (some weren't captured by the parsing script, for various reasons)
should only be "true" for recipient words (i.e. all lgs except WOT, since that's the donor lg)
The text was updated successfully, but these errors were encountered: