Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

az extension add -n azure-cli-ml results in 500 #15434

Closed
lostmygithubaccount opened this issue Oct 7, 2020 · 10 comments
Closed

az extension add -n azure-cli-ml results in 500 #15434

lostmygithubaccount opened this issue Oct 7, 2020 · 10 comments
Assignees

Comments

@lostmygithubaccount
Copy link

az feedback auto-generates most of the information requested below, as of CLI version 2.0.62

Describe the bug
Running az extension add -n azure-cli-ml frequently results in 500 error.

To Reproduce
Run the command in a GitHub action fairly frequently :)

Expected behavior
azure ml cli extension is installed without hitting 500

Environment summary
github action on standard ubuntu agents

Additional context
image

@ghost ghost added the needs-triage This is a new issue that needs to be triaged to the appropriate team. label Oct 7, 2020
@yonzhan
Copy link
Collaborator

yonzhan commented Oct 7, 2020

azure-cli-ml

@yonzhan yonzhan added the Machine Learning az ml label Oct 7, 2020
@ghost ghost removed the needs-triage This is a new issue that needs to be triaged to the appropriate team. label Oct 7, 2020
@yungezz
Copy link
Member

yungezz commented Oct 8, 2020

hi @fengzhou-msft could you pls help to check availability of the storage url?

@v-strudm-msft v-strudm-msft added the ML-MLOps AreaPath label Oct 12, 2020
@lostmygithubaccount
Copy link
Author

@fengzhou-msft @yungezz any updates here? this is increasingly becoming a point of failure in our automated testing for Azure Machine Learning

@fengzhou-msft
Copy link
Member

We have also seen increasing failures for this aka.ms link. We will work with the Redirection Service team to investigate it.

As a workaround, you can get the package URL from here and install with
az extension add -s https://azurecliext.blob.core.windows.net/release/azure_cli_ml-1.15.0-py3-none-any.whl -y to bypass the index link.

@lostmygithubaccount
Copy link
Author

can running az extension add -n azure-cli-ml be changed to that link instead?

@fengzhou-msft
Copy link
Member

can running az extension add -n azure-cli-ml be changed to that link instead?

We can't change it directly. The link is an index of all extensions, CLI looks for the download link for azure-cli-ml extension or any other extensions in it.

We will see how we can improve it once we identify the root cause of the issue.

@lostmygithubaccount
Copy link
Author

@fengzhou-msft any updates?

@fengzhou-msft
Copy link
Member

fengzhou-msft commented Nov 6, 2020

We encountered other connection failures quite frequently recently while downloading different extensions using storage account links directly in Azure Deveops. It looks more like network issues with the CI environment.

@fengzhou-msft
Copy link
Member

We contacted CSS team and got the following response:

As per the issue of random 500 errors for your storage account, our product group has helped to mitigate the issue on the backend.

Regarding the root cause of the issue, one of the backend nodes got an unusual spike of incoming requests, causing higher latencies on the node. As for now, on the Azure Storage side, we are working towards making improvements to detect these spikes sooner and remove such problematic nodes out of service sooner.

cc: @jiasli

@fengzhou-msft
Copy link
Member

We've added retry for fetching the extension index file. This issue should be mitigated.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants