You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Having in mind that your Croatian example is bad Croatian in the first place, the correct sentence would be something like this:
moramo odraditi vrlo kompliciran primjer , rečenicu koja sadrži što više sastojaka i ovisnosti , što je više moguće
Running MSTParser on that corrected sentence gives even more ROOT elements.
We analysed now what might be the problem, and it seems that this "multiple-roots problem" might be linked to the fact that model has been trained on "CONLL-X"(http://anthology.aclweb.org/W/W06/W06-2920.pdf) tagged sentences (as stated on model source site http://nlp.ffzg.hr/resources/models/dependency-parsing/). I also checked the source data that the model has been trained on, and yes, there are multiple 0 dependencies in one sentence.
CONLL-X documents says:
HEAD: Head of the current token, which is either a value of ID, or zero (’0’) if the token links to the virtual root node of the sentence. Note that depending on the original treebank annotation, there may be multiple tokens with a HEAD value of zero.
So, it seems that the you should cover this use-case in your unit tests and potentially other parts of dkpro core.
The text was updated successfully, but these errors were encountered:
Source: #619 (comment)
Having in mind that your Croatian example is bad Croatian in the first place, the correct sentence would be something like this:
Running MSTParser on that corrected sentence gives even more ROOT elements.
We analysed now what might be the problem, and it seems that this "multiple-roots problem" might be linked to the fact that model has been trained on "CONLL-X"(http://anthology.aclweb.org/W/W06/W06-2920.pdf) tagged sentences (as stated on model source site http://nlp.ffzg.hr/resources/models/dependency-parsing/). I also checked the source data that the model has been trained on, and yes, there are multiple 0 dependencies in one sentence.
CONLL-X documents says:
So, it seems that the you should cover this use-case in your unit tests and potentially other parts of dkpro core.
The text was updated successfully, but these errors were encountered: