-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
асфальтны is not analyzed correctly #20
Comments
The same thing here:
|
So we should definitely generate асфальтны, but should we analyse both forms? That is, is асфальтне attested commonly enough? (Btw, the dictionary link doesn't show any relevant information when I click on it.) |
Also, can you confirm how nouns that end in ль behave, like роль, руль, автомобиль? What about words that end in бль, like рубль, ансамбль, etc.? |
Some people of course can write "асфальтне", but it will be spelling mistake. If we analyze both forms, than it will also affect apertium's spellchecker. Although that spellchecker doesn't already work as expected because of many archaic and dialect words in the dictionary, that's why I think we should add some 'Orth' tag for "good" words in the dictionary and spellchecker would use only them... Maybe here we should analyze both forms but add some additional tag that means that it is not orthographically correct. If I remember correctly @IlnarSelimcan already used one a couple of times... |
Most of them have affixes with front vowels, but there might be exceptions. For example, correct ones: |
And some more: |
Do Russian words ending in ‹е› generally take back vowel endings? That is, is this part of a larger pattern, or is it an exception? |
Related issue: we have the lexicon set up to do both ноябрьдә and ноябрьда. Which is correct? |
Also, is it январенда or январендә? Once I got фасоленда working, январендә is now being produced as январенда. I'll hack it to only work with оль words for now, but this will need to be investigated. |
Actually, we do the reverse. We add a tag
Have a look at the commit—with knowledge of how the word-class categorisation works, it's pretty simple to do for many words. |
"Акрополь" is strange. You can search for that word here: |
According to the aforementioned website the correct one is "ноябрьдә". |
And also it says, the correct one is "январенда". |
"фасоль"
|
I cannot right now say it explicitly, but I think you are right. All words that came to my mind have endings with back vowels: ришельесы, ательесы, льесы, подпольесы. |
According to Tatar orthographical dictionary it should be "асфальтны", not "асфальтне":
http://suzlek.antat.ru/words.php?txtW=%D0%B0%D1%81%D1%84%D0%B0%D0%BB%D1%8C%D1%82&submit=%D0%AD%D0%B7%D0%BB%D3%99%D2%AF
The text was updated successfully, but these errors were encountered: