Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to extract entities from text using Wikipedia2Vec #29

Closed
iknoorjobs opened this issue Mar 15, 2019 · 3 comments
Closed

How to extract entities from text using Wikipedia2Vec #29

iknoorjobs opened this issue Mar 15, 2019 · 3 comments

Comments

@iknoorjobs
Copy link

iknoorjobs commented Mar 15, 2019

Is there any way I can know the vocabulary of the pretrained model? Like in case of Word2Vec.

@ikuyamada
Copy link
Contributor

ikuyamada commented Mar 15, 2019

You can obtain the list of entities by calling entities() method of the dictionary instance.

from wikipedia2vec import Wikipedia2Vec
model = Wikipedia2Vec.load(MODEL_FILE)
list(model.dictionary.entities())

@iknoorjobs
Copy link
Author

iknoorjobs commented Mar 15, 2019

Okay, thanks a lot.
And is there any way to extract entities from a paragraph of text?

@ikuyamada
Copy link
Contributor

Wikipedia2Vec does not currently provide a model to extract entities. You need to build a model based on the embeddings.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants