Skip to content
This repository has been archived by the owner on Oct 31, 2023. It is now read-only.

Can't find model 'en_core_web_sm' #147

Closed
mug2mag opened this issue May 17, 2021 · 3 comments
Closed

Can't find model 'en_core_web_sm' #147

mug2mag opened this issue May 17, 2021 · 3 comments

Comments

@mug2mag
Copy link

mug2mag commented May 17, 2021

Hi
I am trying to evaluation on NQ dataset:

python dense_retriever.py model_file=/home/dhf/git_local/DPR/downloads/checkpoint/retriever/single-adv-hn/nq
qa_dataset=nq_test ctx_datatsets=/home/dhf/git_local/DPR/downloads/data/wikipedia_split
encoded_ctx_files=[\"/home/duhuifang/git_local/DPR/downloads/data/retriever_results/nq/single-adv-hn/wiki_passages_*\"] 
out_file=/home/duhuifang/git_local/DPR/downloads/evaluation

Getting:
OSError: [E050] Can't find model 'en_core_web_sm'. It doesn't seem to be a Python package or a valid path to a data directory.

@vlad-karpukhin
Copy link
Contributor

It is Spacy's library dependency we use.
We will need to add it somehow to out setup script.
Running: 'python -m spacy download en_core_web_sm' should solve this issue.
Please refer to https://spacy.io/usage for installation instructions.

@mug2mag
Copy link
Author

mug2mag commented May 19, 2021

@vlad-karpukhin Very appreciated!

@mug2mag mug2mag closed this as completed May 19, 2021
@xhluca
Copy link

xhluca commented Aug 2, 2022

@vlad-karpukhin It's possible via install_requires starting in pip>18, as discussed here: https://stackoverflow.com/a/53383704

Let me know if I can start a PR for this.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants