-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is this ready to use? #2
Comments
The model is usable right now and achieves quite good results, we will publish the precise evaluation metrics shortly. You can download it here: http://zil.ipipan.waw.pl/SpacyPL |
That sounds good, can't wait to see the results. Two questions:
|
|
We have some manually annotated data that we will compare the parser against, so I'll report if we find something that seems off. Please notify me (here) once the new model is out. (: |
The new version came out, please see here for details. The structure of the sentence might be a little difficult, but as I've said, all examples should have only one Some of the problems with the parser must be attributed to the way tokenization is handled in the training. The default tokenizer is used (and substituting Morfeusz here would not really be ideal, as it would make reproducible training substantially harder) becasue With respect to Morfeusz, the Please let us know if anything pops up! |
Thanks for the update. So far, our results have been somewhat discouraging, as we get fairly low agreement between our manual annotation and what I'm extracting using the parser, though that is still work in progress. The sentences we use are from wikipedia, so the one cited above is actually on the simpler side of things, maybe the treebank(s) do not reflect that kind of structure. I will have a look at the new version tomorrow, hopes are up in any case! (: |
While we work on this spacy model, you may also take a look at these tools and models, which may give better results at the moment. |
Hi. I am working on Polish and was looking for a parser and just found this. So I was wondering - is this ready to be used? Is there any information on how well the components perform, in particular the dependency parser? Otherwise, could you recommend one of the existing tools? (:
The text was updated successfully, but these errors were encountered: