-
-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Information Extraction (Knowledge Triples) #3303
Comments
If you haven't seen it yet, you might find these examples useful: https://github.com/explosion/spaCy/tree/master/examples/information_extraction Especially the entity relations script shows a very similar use case: extracting the relationships between phrases and named entity types, using the dependency parse. For the new v2.1 docs, I also added a section on combining models with rules for information extraction. It's not live yet, but you can already read the draft here: https://github.com/explosion/spaCy/blob/develop/website/docs/usage/rule-based-matching.md#combining-models-and-rules-models-rules |
Thanks for pointing me in the right direction, @ines I have the following code, but it doesn't seem very robust:
which produces:
For example, it doesn't capture ideas like Obama being president of the United States. Do you have any recommendations to make this more robust? |
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs. |
Feature description
I've seen scattered posts and issues about information extraction using spaCy, but no concrete solution.
Ideally, we'd have the following:
Example:
Input: "Barrack Obama was born in Hawaii. He was president of the United States and lived in the White House"
Output:
Is this something that can be easily done at the moment?
Could the feature be a custom component or spaCy plugin?
If so, we will tag it as
project idea
so other users can take it on.The text was updated successfully, but these errors were encountered: