Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NE type consistency in relation to nerd-kid #74

Open
tantikristanti opened this issue Mar 20, 2018 · 4 comments
Open

NE type consistency in relation to nerd-kid #74

tantikristanti opened this issue Mar 20, 2018 · 4 comments
Assignees

Comments

@tantikristanti
Copy link
Collaborator

tantikristanti commented Mar 20, 2018

The same mention can have different types.

One example can be found in the result of a text disambiguation service with the text PubMed_2. It shows that has the same mentions can have different of Type.

screen shot 2018-03-20 at 16 09 25

@kermitt2
Copy link
Owner

Hello this is done on purpose, as we don't know which type is the good one a priori, and the type of the base mention is not overwritten to the acronyms.

Basically I am waiting a mapping of the entities to the types (something like nerd-kid ;) ), and there is nothing particular to the fact it is acronym or not in this situation.

@kermitt2 kermitt2 added enhancement and removed bug labels Mar 20, 2018
@kermitt2
Copy link
Owner

I suggest to either close it or rename/reopen it to broad NE type consistency in relation to nerd-kid.

@tantikristanti tantikristanti changed the title Same acronyms can have different types NE type consistency in relation to nerd-kid Mar 20, 2018
@tantikristanti tantikristanti self-assigned this Jun 1, 2018
@tantikristanti
Copy link
Collaborator Author

Responding to the Named Entity consistency issue #74, a new branch called "0.0.3-nerdKid" has been built for the purpose of developing a mapping of the entitites to the types.

Even though the works are still in progress, but to give some ideas of how it would look like, the results of a dedicated database containing the mapping of entities into the types (e.g. Person, Location, Organisation) have been "inserted" in the demo console of the branch as a field Predicted as.

The results of text disambiguation then can be seen as follows:

  • some entities that are not recognized by entity-fishing became recognizable (the field Predicted as)
  • some entities that are recognized but not consistent, like for example, "TAK1" and "CS" became consistent as "Other"

screen shot 2018-11-21 at 15 20 59

Further works are to ascertain whether the inclusion of the results of the entities <-> class types will give a better impact.

@tantikristanti
Copy link
Collaborator Author

tantikristanti commented Feb 4, 2019

For entity-fishing version 0.0.4, the NE type generated by nerdKid has been integrated in branch 0.0.4-withNerdKidTypeInDemo

screen shot 2019-02-04 at 13 42 25

screen shot 2019-02-04 at 13 42 04

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants