Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add deployment update post #58

Merged
merged 5 commits into from
Nov 1, 2019
Merged

Add deployment update post #58

merged 5 commits into from
Nov 1, 2019

Conversation

mrocklin
Copy link
Member

This includes a summary and links to the various deployment efforts that
have occurred in the last few months.

So far this is pretty rough. Any help, including direct edits, would be welcome.

cc @lesteve @guillaumeeb @jhamman @andersy005 @jacobtomlinson @jcrist @pentschev

This includes a summary and links to the various deployment efforts that
have occurred in the last few months.
Dask-Cloudprovider and Dask-CUDA libraries place them
all under the same `dask.distributed.SpecCluster` superclass. So we can expect a high degree of
uniformity from them. Additionally, all of the classes now inherit from the
`dask.distributed.Cluster` class, which standardizes things like adaptivity,
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Note: dask-gateway and dask-yarn don't inherit from dask.distributed.Cluster, but do match the API.

Copy link
Member

@jacobtomlinson jacobtomlinson left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Look great to me! Thanks for putting this together.

For cloud deployments we generally recommend using a hosted Kubernetes or Yarn
service, and then using Dask-Kubernetes or Dask-Yarn on top of these.

However in some institutions these hosted services aren't yet accessible, and
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would maybe reword this as something like "in some institutions they have made decisions or commitments to use certain vendor specific technologies."

I think a lot of the use cases for dask-cloudprovider are going to be folks who have gone "all in" on a certain cloud provider or set of technologies. I've seen a few organisations to this, sometimes for a reduced price, sometimes to encourage a constrained pallete of tools, etc.

_posts/2019-10-16-deployment-updates.md Outdated Show resolved Hide resolved
_posts/2019-10-16-deployment-updates.md Outdated Show resolved Hide resolved
found in HPC centers and Dask-Kubernetes. These now share a common codebase
along with Dask SSH, and so are much more consistent and hopefully bug free.

Hopefully users shouldn't notice much difference with existing workloads,
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You've got two hopefullys four words apart here. May want to swap one for something else.


In some cases users may not have access to the cluster manager. For example
the institution may not give all of their data science users access to the Yarn
or Kubernetes cluster. In this the [Dask-Gateway](https://gateway.dask.org)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
or Kubernetes cluster. In this the [Dask-Gateway](https://gateway.dask.org)
or Kubernetes cluster. In this case the [Dask-Gateway](https://gateway.dask.org)

1. One Dask-worker per GPU on a machine
2. Specify the `CUDA_VISIBLE_DEVICES` environment variable to pin that worker
to that GPU
3. If your machine has multiple network interfaces then choose the network interface closest to that GPU
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe we should define what "closest" means in this context, which is relative to system's topology. Ideally, we would provide samples on how can users find that information, but I think this may be too out of context for this blog.

Copy link
Member

@guillaumeeb guillaumeeb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks fo this @mrocklin, sorry I did not notice it among all other notifications 🙂.
Did a few suggestions, take or leave, this is already very nice!

_posts/2019-10-16-deployment-updates.md Outdated Show resolved Hide resolved
_posts/2019-10-16-deployment-updates.md Outdated Show resolved Hide resolved
_posts/2019-10-16-deployment-updates.md Outdated Show resolved Hide resolved
_posts/2019-10-16-deployment-updates.md Outdated Show resolved Hide resolved
the institution may not give all of their data science users access to the Yarn
or Kubernetes cluster. In this case the [Dask-Gateway](https://gateway.dask.org)
project may be useful.
It can launch and manage Dask jobs,
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Dask jobs or Dask clusters?

mrocklin and others added 2 commits November 1, 2019 07:46
Co-Authored-By: Guillaume Eynard-Bontemps <g.eynard.bontemps@gmail.com>
@mrocklin mrocklin merged commit 55893df into dask:gh-pages Nov 1, 2019
@mrocklin
Copy link
Member Author

mrocklin commented Nov 1, 2019

Thanks all. This is in.

@mrocklin mrocklin deleted the deployment branch November 1, 2019 14:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

8 participants