You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Evaluated example from readme for kazakh language with no error, but result is wrong. English language works fine.
Expected Result
I expected a list of words with their correct normalized form but for every word normalization form consists of only 1 letter.
Actual Result
[[1 Алтай а NOUN adj Case=Gen 2 nmod:poss _ _,
2 жерінің ж NOUN n _ 3 obl _ _,
3 асты а VERB adj _ 4 nsubj _ _,
4 қандай қ PRON adv _ 5 nsubj _ _,
5 қазыналы қ VERB n _ 0 root _ _,
6 болса б VERB v Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin 5 cop _ SpaceAfter=No,
7 . . PUNCT sent _ 5 punct _ _],
[1 Ағаш а VERB v Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin 0 root _ SpaceAfter=No,
2 . . PUNCT sent _ 1 punct _ SpaceAfter=No]]
Reproduction Steps
from cube.api import Cube # import the Cube object
cube=Cube(verbose=True) # initialize it
cube.load("kk") # select the desired language (it will auto-download the model on first run)
text="Алтай жерінің асты қандай қазыналы болса. Ағаш."
sentences=cube(text) # call with your own text (string) to obtain the annotations
sentences
System Information
Python version 3.6.12
Operating system Ubuntu 20.04
The text was updated successfully, but these errors were encountered:
Hi @pas-valkov - I'm really sorry for the late response. I'm updating the models for 3.0 right now and hopefully it will fix the issue. Sorry again, I don't know how I missed this issue. I will let you know as soon as it's fixed.
I've just uploaded the updated model. Take into consideration that Kazakh is a really small treebank in UD and the system will not have a high accuracy.
Evaluated example from readme for kazakh language with no error, but result is wrong. English language works fine.
Expected Result
I expected a list of words with their correct normalized form but for every word normalization form consists of only 1 letter.
Actual Result
[[1 Алтай а NOUN adj Case=Gen 2 nmod:poss _ _,
2 жерінің ж NOUN n _ 3 obl _ _,
3 асты а VERB adj _ 4 nsubj _ _,
4 қандай қ PRON adv _ 5 nsubj _ _,
5 қазыналы қ VERB n _ 0 root _ _,
6 болса б VERB v Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin 5 cop _ SpaceAfter=No,
7 . . PUNCT sent _ 5 punct _ _],
[1 Ағаш а VERB v Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin 0 root _ SpaceAfter=No,
2 . . PUNCT sent _ 1 punct _ SpaceAfter=No]]
Reproduction Steps
from cube.api import Cube # import the Cube object
cube=Cube(verbose=True) # initialize it
cube.load("kk") # select the desired language (it will auto-download the model on first run)
text="Алтай жерінің асты қандай қазыналы болса. Ағаш."
sentences=cube(text) # call with your own text (string) to obtain the annotations
sentences
System Information
The text was updated successfully, but these errors were encountered: