Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

provide more distributed strategies #135

Open
rom1504 opened this issue Feb 7, 2022 · 2 comments
Open

provide more distributed strategies #135

rom1504 opened this issue Feb 7, 2022 · 2 comments

Comments

@rom1504
Copy link
Owner

rom1504 commented Feb 7, 2022

for example

  • pure ssh
  • dask cluster ?
  • ray cluster ?

follow up of #20

@rom1504
Copy link
Owner Author

rom1504 commented Feb 8, 2022

With new information I gathered, the more important thing here would be to make it as easy as possible to make img2dataset usable in a swarm environnement rather than a cluster: many varied kind of nodes connecting and helping out for a while then disconnecting. This is already kind of working thanks to spark dynamic allocation feature but it could be better tested and better documented / easier to run.
Ideally it would even be possible to do this kind of stuff in a trustless fashion, but this would probably require a lot more engineering than trustful but unreliable

Being able to handle unreliable resources would unlock combining many different resources rather than needing to allocate a lot of resources in a single place

@rom1504 rom1504 pinned this issue Apr 2, 2022
@rom1504
Copy link
Owner Author

rom1504 commented May 16, 2022

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant