-
-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Question]: Pre-Tagging information in Sequence-Tagging? #3416
Comments
Hi @raykyn I am sorry for late response. |
Hi, thank you for the reponse! I'm not sure what you refer to in that tutorial? Unless you mean the format in which the annotation is printed with the |
Hi @raykyn, another solution you can consider is training two separate models using multitask learning and have the shared embedding do auto feature engineering since the embeddings will create features for tagging both event triggers, actors and objects as well as PER and ORGs. The model will learn something is tagged as PER or ORG will more likely be actor as well. This will probably improve the model performance of your main task without having add the tags. Another benefit to this approach is when you run your model at inference time you don't need to run the another model. |
Thank you very much for this input, I actually completely missed that there is support for multitask learning! |
Question
My specific use case:
I'm trying to solve an event-extraction task which I model as a sequence-tagging problem. So this event-tagger should be able to identify the event trigger, actors and objects in a given span. Now, I've got pretty reliable NER-tags which I would like as a additional information for my event-tagger to use as information (as for example, only PER and ORGs may be actors).
Is there a best practice to use this information? I'm thinking the easy way would be to put annotations inside the train/dev/test data so they can be part of the encoding? Is there a known best way to write these anntotations?
Barack Obama went to New York
[PER] Barack Obama [/PER] went to [LOC] New York [/LOC]
Barack [B-PER] Obama [I-PER] went to New [B-LOC] York [I-LOC]
I guess I'm asking less for a technical solution and more if there is an established way to do this or at least some experience?
(also to the devs: thank you for this awesome framework and the char-based embeddings, pretty much none of my research would be possible without Flair)
The text was updated successfully, but these errors were encountered: