New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[en] Chunker improvement #7044
Comments
I have made a change locally. The change wasn't difficult, but at least 12 tests break now. These seem to be cases where the chunker is wrong and has always been, but now it causes an error. One example:
"Adults average" is considered a noun phrase, and now it's considered a singular noun phrase, breaking |
Hi Daniel, Yes, I will work on these cases. I will probably need help with setting up a branch. Another type of chunker error is where there are different readings for upper-case text and lower-case text:
|
The work-in-progress branch (which tests that don't work) is now available here: https://github.com/languagetool-org/languagetool/tree/issue-7044-chunker |
@languagetool-org/developers, if @danielnaber is out of the office today, can one of you please look at #7050 ? Many thanks. I think that I have possibly not used GitHub correctly. I am not sure what to do. |
PR has been merged now. |
@danielnaber, thank you. I know about more chunker errors. Do you want them? If yes, I will make a new issue. |
We cannot really fix them, but maybe we can add antipatterns, so please send them. |
Is it possible change the behaviour of the chunker?
With a plural noun in a singular noun phrase, the chunker incorrectly gives the last word the chunk
E-NP-plural
. Some examples:The aircraft maintenance manager is in the hangar.
Does your box converter operate correctly?
The happy sheep is grazing in the field.
The shiny new aircraft was on the runway.
I’d like a fish pie and a kilo of sprouts.
(Only a very small proportion of plural noun phrases end with a singular noun. Example, ‘sergeants major’: https://www.merriam-webster.com/dictionary/sergeants%20major).
The text was updated successfully, but these errors were encountered: