You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Anything else we need to know?:
Probably due to the p2p shuffling. dask_df = dask_df.shuffle(on="b", shuffle="tasks").persist()
does not have the issue.
If the column is not categorical it also does not happen.
Describe the issue:
If you P2P shuffle while your dataframe has a categorical column that column will be turned into all NaNs.
Minimal Complete Verifiable Example:
Anything else we need to know?:
Probably due to the p2p shuffling.
dask_df = dask_df.shuffle(on="b", shuffle="tasks").persist()
does not have the issue.
If the column is not categorical it also does not happen.
Might be related to
#8183 and #8165
Environment:
Dask version: 2023.9.1
Python version: 3.10
Operating System: Linux
Install method (conda, pip, source): conda
The text was updated successfully, but these errors were encountered: