-
Notifications
You must be signed in to change notification settings - Fork 511
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Enable training single GPU cuML models using Dask DataFrames and Series #4300
Conversation
Checks for dask_cudf objects
rerun tests |
1 similar comment
rerun tests |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just two small comments remaining
@@ -556,8 +563,12 @@ def convert_dtype(X, | |||
if the conversion would lose information. | |||
""" | |||
|
|||
if isinstance(X, (dask_cudf.core.Series, dask_cudf.core.DataFrame)): | |||
# TODO: Warn, but not when using dask_sql |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you open a github issue to track this?
Codecov Report
@@ Coverage Diff @@
## branch-21.12 #4300 +/- ##
===============================================
Coverage ? 86.01%
===============================================
Files ? 231
Lines ? 18771
Branches ? 0
===============================================
Hits ? 16146
Misses ? 2625
Partials ? 0
Flags with carried forward coverage won't be shown. Click here to find out more. Continue to review full report at Codecov.
|
@gpucibot merge |
…es (rapidsai#4300) This PR makes it possible to train single GPU cuML models using Dask DataFrames and Series by converting the Dask data-structures to their cudf counterparts before training. This will allow using Dask-SQL with cuML models. Tests added for logistic regression, currently working on adding more Depends on rapidsai#4317 Authors: - https://github.com/ChrisJar - Sarah Yurick (https://github.com/sarahyurick) - Dante Gama Dessavre (https://github.com/dantegd) Approvers: - Dante Gama Dessavre (https://github.com/dantegd) URL: rapidsai#4300
This PR makes it possible to train single GPU cuML models using Dask DataFrames and Series by converting the Dask data-structures to their cudf counterparts before training. This will allow using Dask-SQL with cuML models.
Tests added for logistic regression, currently working on adding more
Depends on #4317