Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Long wait time to get dask workers #953

Open
jbusecke opened this issue Apr 30, 2021 · 3 comments
Open

Long wait time to get dask workers #953

jbusecke opened this issue Apr 30, 2021 · 3 comments

Comments

@jbusecke
Copy link

I have been noticing very long wait times to get dask workers to come online lately.

It just took me ~30 min to get any workers on the pangeo google cloud deployment.

Is there a way to resolve this? @rabernat suggested that "the cluster is maxed out".

For completeness, this is what I do in my notebook (pretty much the recommmended code):

from dask_gateway import GatewayCluster

cluster = GatewayCluster()
# cluster.adapt(minimum=4, maximum=40)  # or  to a fixed size.
cluster.scale(10)
cluster
@rabernat
Copy link
Member

Apparently we have a 100 vCPU limit on the cluster, and today we were at that limit.

I just bumped it to 200. (For those with access, the page is here: https://console.cloud.google.com/kubernetes/clusters/details/us-central1-b/pangeo-uscentral1b/details?project=pangeo-181919)

Did that resolve the issue?

@jbusecke
Copy link
Author

I was eventually able to get workers even before raising this issue, but Ill keep an eye out in the upcoming days.

@jbusecke
Copy link
Author

Quick update: Right now I am getting dask workers quickly! Thanks for the adjustment.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants