Model download returns 0 on failure #1714

mvollrath · 2017-12-11T22:30:57Z

While building a Docker image containing spaCy, there was an issue downloading the basic model with python -m spacy download en:

ReadTimeoutError: HTTPSConnectionPool(host='github-production-release-asset-2e65be.s3.amazonaws.com', port=443): Read timed out.

The download returned 0 so the build continued. Next, while building an Rasa NLU model, it couldn't find the model (as you might expect):

IOError: Can't find model 'en'

So I thought, if the download doesn't report failure, maybe I should use the validate command to make sure there's a model present before continuing the build? But validate also returns 0 when no models are present:

    Installed models (spaCy v2.0.5)
    /usr/local/lib/python2.7/dist-packages/spacy


    No models found in your current environment.

I would expect the download tool to return non-zero when it fails to finish downloading a model.

Your Environment

Python version: 2.7.12
Platform: Linux-4.4.0-98-generic-x86_64-with-Ubuntu-16.04-xenial
spaCy version: 2.0.5
Environment Information: building an image with Docker 17.09.0

The text was updated successfully, but these errors were encountered:

ines · 2017-12-12T00:21:43Z

Thanks – good point, I didn't even realise the download command currently behaves like this – this should definitely be fixed. The validate command was originally intended as more of a user-facing utility that prints nicely formatted and helpful info about the models, so we didn't really consider the automated usage and exit codes here... but we might as well do it properly, so thanks for the suggestion!

Btw, if you're downloading models as part of an automated process, you can also just run pip install directly, and use the URL of the model archive (see the model releases). This lets you download the exact model and model version you need, and saves you the extra roundtrip to the spaCy compatibility table. Instead of calling spacy.load(), you can also import the model as a module:

import en_core_web_sm
nlp = en_core_web_sm.load()

In some cases, it might be a little nicer to get a more "native" ImportError if the model isn't installed, rather than a spaCy error somewhere down the line. (But this also depends on your personal preference. You can find more details on this in the "Using models in production" section in the docs.)

lock · 2018-05-08T03:55:29Z

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

ines added the enhancement Feature requests and improvements label Dec 12, 2017

ines mentioned this issue Jan 3, 2018

💫 Improve model downloading and linking #1792

Merged

3 tasks

ines added a commit that referenced this issue Jan 3, 2018

Exit with 1 if incompatible models found (see #1714)

2c656f9

honnibal closed this as completed in dacfaa2 Jan 11, 2018

lock bot locked as resolved and limited conversation to collaborators May 8, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model download returns 0 on failure #1714

Model download returns 0 on failure #1714

mvollrath commented Dec 11, 2017

ines commented Dec 12, 2017 •

edited

lock bot commented May 8, 2018

Model download returns 0 on failure #1714

Model download returns 0 on failure #1714

Comments

mvollrath commented Dec 11, 2017

Your Environment

ines commented Dec 12, 2017 • edited

lock bot commented May 8, 2018

ines commented Dec 12, 2017 •

edited