-
-
Notifications
You must be signed in to change notification settings - Fork 256
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ColumnTransformer: 'DataFrame' object has no attribute 'take' with sklearn >= 1.0.0 #887
Comments
Thanks for the report @zexuan-zhou. Are you able to debug it further? Most likely scikit-learn previously cast a (dask) DataFrame to an ndarray, but no longer does that. We were apparently relying on that behavior for this example. That said, scikit-learn not casting is probably a good thing. So we might want to update accordingly. |
I'm not 100% sure but I'm like 90% sure that's the case because I have to update my test cases to differentiate pd object and np object now with the new version of sklearn. |
cc @VibhuJawa (in case this is of interest) |
Anyone working on this or any update? |
Not aware of anyone working on this. If this is of interest, feel free to pick it up :) |
@jakirkham , Unfortunately , I don't have that deep knowledge of dask internals. |
@TomAugspurger please see the below reproducible example. I went through other github issues regarding dask and column transformer and found some comments suggesting that we downgrade to sklearn 0.24, dask_ml==1.9.0, and dask=='2021-12-03'. Below is a reproducible example of what passes / fails regarding the dask ColumnTransformer.
When I upgrade dask, dask_ml, and sklearn to 1.1.1 I get the same errors above: |
Thanks for the reproducible example. We'll need someone to step through and figure out exactly what changed in scikit-learn / pandas and adapt. I won't have time to work on this anytime soon. |
Any update on that? I got the same issue with dask columnTrasformer using This is the code I used:
Version |
What happened:
from dask_ml.compose import ColumnTransformer
doesn't supportsklearn >= 1.0.0
even though it says it supports itdask-ml/setup.py
Line 19 in cf24100
What you expected to happen:
use
scikit-learn==0.24.0
with the following code worksMinimal Complete Verifiable Example:
Environment:
The text was updated successfully, but these errors were encountered: