Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Removed DaskBackend and moved tests to RayBackend #1412

Merged
merged 7 commits into from
Oct 24, 2021
Merged

Conversation

tgaddair
Copy link
Collaborator

The DaskBackend only works because of Petastorm and TFRecord as intermediate storage formats. However, these are planned to be removed in place of RayDatasets, which has been shown to outperform these other formats in our benchmarks.

Additionally, the DaskBackend did not support distributed training (only distributed preprocessing) making its utility rather limited. I think it would be worth revisiting this backend in the future if there is a compelling use case from users, but for now it's better to only support that which is well tested and proven effective.

@tgaddair tgaddair merged commit 95a7ce3 into master Oct 24, 2021
@tgaddair tgaddair deleted the dask-to-ray branch October 24, 2021 05:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant