Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

STOP_NER stops Locations from being found with rules #738

Open
maxaalexeeva opened this issue Jan 26, 2020 · 1 comment
Open

STOP_NER stops Locations from being found with rules #738

maxaalexeeva opened this issue Jan 26, 2020 · 1 comment
Assignees

Comments

@maxaalexeeva
Copy link
Contributor

While we were working on the migration branch, @zupon wrote a few backoff rules to get the locations that are not found by the neural net:

It looks like (at least) this is stopping these rules from working:

// Make sure there is a noun that isn't a named entity. We can also check for stop words with some re-architecting...

!StopwordManager.STOP_NER.contains(entities(i))

because 'LOCATION' is included in the list of the NER stop labels:

val STOP_NER: Set[String] = Set("DATE", "DURATION", "LOCATION", "MONEY", "NUMBER", "ORDINAL", "ORGANIZATION", "PERSON", "PLACE", "SET", "TIME")

What would be the way to go about this? Taking locations out of the stop word list? Making a more fine-grained check for what entities to filter out and which ones to keep? Let it be and just keep missing some of the locations? Some other option?

Here's an example:

a) with entities filtered out:
Screenshot from 2020-01-26 13-01-08

b) with entities not filtered out (the locations are found with the location-backoff rule):
Screenshot from 2020-01-26 13-00-40

@BeckySharp
Copy link
Contributor

BeckySharp commented Jan 27, 2020 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants