Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature Request: Support Nested Named Entity Recognition #544

Closed
alokpadhi opened this issue Jan 20, 2020 · 10 comments
Closed

Feature Request: Support Nested Named Entity Recognition #544

alokpadhi opened this issue Jan 20, 2020 · 10 comments
Labels
feature request feature request for doccano
Projects

Comments

@alokpadhi
Copy link

alokpadhi commented Jan 20, 2020

Feature description

Currently NER tagging only provides to label single entity at a time. Is it possible to do entity inside entity (nested entity).
For example inside an entity personal info, an entity name can be placed.
My name is xxx and I live in yyy. Here the whole sentence is personal info but the xxx is a name entity.

@icoxfog417 icoxfog417 changed the title Entity inside Entity Feature Request: Support Nested Named Entity Recognition Jan 20, 2020
@icoxfog417 icoxfog417 added the feature request feature request for doccano label Jan 20, 2020
@icoxfog417
Copy link
Contributor

There are some variations of entity overwrapping. We have to deal with these cases carefully to implement the annotation feature.

Reference: Recognizing Complex Entity Mentions: A Review and Future Directions

@Hironsan
Copy link
Member

Hello @alokpadhi ,

Do you know any annotation tool which can annotate nested NER?
If you know it, please tell me for my reference.

@alokpadhi
Copy link
Author

Hello @alokpadhi ,

Do you know any annotation tool which can annotate nested NER?
If you know it, please tell me for my reference.

For that matter you can use Inception, Dataturks, brat

I found inception and dataturks user friendly. But I will be very happy if Doccano comes with this feature asap.

@scheiblr
Copy link
Contributor

scheiblr commented Mar 5, 2020

Hello @alokpadhi ,

Do you know any annotation tool which can annotate nested NER?
If you know it, please tell me for my reference.

Anafora could be interesting, too.

@alokpadhi
Copy link
Author

Hello @alokpadhi ,
Do you know any annotation tool which can annotate nested NER?
If you know it, please tell me for my reference.

Anafora could be interesting, too.

I see, it's quite an interesting tool. and It looks more advance tool for labeling schema than the ones I have mentioned earlier.

@Hironsan Hironsan added this to To do in v1.5.0 Mar 31, 2020
@ironsilk
Copy link

ironsilk commented Jun 6, 2020

I needed this feature for myself so i made an implementation which is fine for my use. Unfortunately i had to remove the update functionality because the outer NER would always juxtapose the inner NER in terms of v-menu. Right now, the outer NER can only be deleted after you have deleted all it's inner annotations. It's possible to recursively delete its children but feature is not implemented.

This version only accepts one level of depth ( 1 outer annotation and more inner annotations, as in the example image).

I've also added the functionality to "round" selection to whole words.
I've included a zip containing the modified following files:
components/organisms/annotation/EntityItemBox.vue
and
components/molecules/EntityItem.vue

EntityItem.zip
Screenshot from 2020-06-06 12-17-05

PS: the text is random, source: https://github.com/minimaxir/textgenrnn/blob/master/datasets/reddit_relationshipadvice_legaladvice_2000.txt

@Hironsan
Copy link
Member

Hironsan commented Jul 25, 2020

Rough design idea:

image

  • From the perspective of the backend, it's relatively easy to support nested named entities.
  • From the perspective of the frontend, it's necessary for me to investigate which UI/UX is the best to use, how to achieve the UI/UX, and how to integrate that to the existing one.

@Hironsan
Copy link
Member

Coming soon...

@akinolawilson
Copy link

Is there currently an alternative path anyone suggests?

I am working on a named entity recognition and linking pipeline. I would like to be able to specify the unique identifier inside of the labels provided by Doccano.

@PeterPanUnderhill
Copy link

Coming soon...

Is there an estimated time?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request feature request for doccano
Projects
No open projects
v1.5.0
  
Done
Development

No branches or pull requests

7 participants