Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
Issues with SLP model and POS tagging (pattern.en) #182
There are currently some issues with POS misclassification in
# Assert [("black", "JJ"), ("cats", "NNS")]. v = en.tag("black cats") self.assertEqual(v, [("black", "JJ"), ("cats", "NNS")])
The test fails because 'black' get classified as
Here is what happens: When we call
Sure, the SLP model is a statistical model and consequently is allowed to be wrong in some cases, but what bothers me is that it apparently used to work some time ago. Sentences of the form "The black cat sat on..." are scattered everywhere across unit tests that I can't believe that the model got that wrong all the time.
I just can't find the cause for this change. @tom-de-smedt, what am I missing?