New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Differentiate between binding and advertising address for dask workers #1258
Comments
|
Unless I misunderstand, I have another similar but different use case: My scheduler is running in the cloud, but my worker is behind a router/nat. I had started off thinking that port-forwarding would be enough, but seem to have this same issue. If I give the worker the local IP (or let it discover it), the scheduler will try and fail to connect to the 192.168.x.x. address. If I give
I guess a work-around would be to do something with SSH tunnelling, but that seems messy. |
|
Yes this is very similar. I believe while ssh tunnelling could be potentially used (I have not tried this yet) to bring all containers to the same network it is not just messy but may also be very costly in case of data shuffle across workers/scheduler. Correct me if I am wrong. |
|
@sohaibiftikhar you might look for |
|
@mrocklin doing a simple test where I just hard-coded the |
|
Note that register is called in two locations, first when it registers and then for every heartbeat ( a few seconds later). Heartbeat messages are just idempotent registration messages. |
|
@mrocklin So I looked at the code and I have a fix ready that I still need to test but before that I have a doubt regarding the worker address binding. If we start the worker with --UPDATE-- |
Currently the bind and the advertise ip of dask-workers is the same. In case the dask-worker is running inside a docker container in bridge mode these addresses will not be the same. While the worker binds to some internal address the scheduler needs to communicate to it using some external IP.
Currently we only have the option of settings the host address or the interface on which the Dask-worker listens on but nothing about which address is advertised to the scheduler.
The text was updated successfully, but these errors were encountered: