Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing ImageNet Sample Dataset - ERROR 404: The specified blob does not exist #111593

Closed
mattmazzola opened this issue Jun 29, 2023 · 3 comments
Closed

Comments

@mattmazzola
Copy link
Member

mattmazzola commented Jun 29, 2023

Issue

On step: https://learn.microsoft.com/en-us/azure/machine-learning/how-to-image-processing-batch?view=azureml-api-2&tabs=cli#registering-the-model

It asks to download a sample dataset to test image classifier

wget https://azuremlexampledata.blob.core.windows.net/data/imagenet-1000.zip

Error

mattmazzola@<removed>:/mnt/d/repos/azureml-examples/cli/endpoints/batch/deploy-models/imagenet-classifier$ wget https://azuremlexampledata.blob.core.windows.net/data/imagenet-1000.zip
 imagenet-1000.zip--2023-06-29 10:40:43--  https://azuremlexampledata.blob.core.windows.net/data/imagenet-1000.zip
 -d dataResolving azuremlexampledata.blob.core.windows.net (azuremlexampledata.blob.core.windows.net)... 20.209.0.229
Connecting to azuremlexampledata.blob.core.windows.net (azuremlexampledata.blob.core.windows.net)|20.209.0.229|:443... connected.
HTTP request sent, awaiting response... 404 The specified blob does not exist.
2023-06-29 10:40:44 ERROR 404: The specified blob does not exist..

It seems directory listing isn't enabled for the storage container so it's not clear what the correct file path should be.

However, this model blob downloaded previously in tutorial DID exist so the container must be there.

https://azuremlexampledata.blob.core.windows.net/data/imagenet/model.zip

Solution

I was able to manually add the missing imagenet segment to url and it worked!

Label URL
Incorrect wget https://azuremlexampledata.blob.core.windows.net/data/imagenet-1000.zip
Correct wget https://azuremlexampledata.blob.core.windows.net/data/imagenet/imagenet-1000.zip

This could be a more significant issue since it there isn't an obvious fix and not having data would block people from completing the tutorial.

Suggestion

Perhaps there could be a PR validation action that detects all blob urls in the markdown then makes a request to verify that url destination exists or it would reject the PR. I suppose the issue could still occur if someone manually deleted the blob after the fact, but it could be caught on next PR validation run.


Document Details

Do not edit this section. It is required for learn.microsoft.com ➟ GitHub issue linking.

@santiagxf
Copy link
Contributor

Thanks for reporting this issue @mattmazzola. Indeed, you were right. The path is wrong in the CLI example, while it was correct in the SDK one. It has been addressed in the following PR: Azure/azureml-examples#2413. Thanks!

@Naveenommi-MSFT
Copy link
Contributor

@mattmazzola
Thank you for bringing this to our attention. We have created a PR for this issue and the changes should go live soon.

@santiagxf
Thank you for your response.

@Naveenommi-MSFT
Copy link
Contributor

@mattmazzola
We are going to close this thread as resolved but if there are any further questions regarding the documentation, please tag me in your reply and we will be happy to continue the conversation.
#Please-close

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants