You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
To be clear, the bug seems to be triggered under these conditions since 0.20.22:
Sort by multiple columns with nulls_last=True AND
One or more columns are lexical ordering Categorical with no categories and all-null values.
It works fine with 0.20.21.
Note that in the example the df2["b"] doesn't have to be a lexical Categorical, it could very well be string type and sort on df4 will still segfault. I make df2["b"] a Categorical to show that having an empty category list seems to be essential to trigger the problem. The cast seems to recompute the categories even when the type is not actually changing.
Checks
Reproducible example
Log output
Issue description
To be clear, the bug seems to be triggered under these conditions since 0.20.22:
nulls_last=True
ANDIt works fine with 0.20.21.
Note that in the example the
df2["b"]
doesn't have to be a lexical Categorical, it could very well be string type and sort ondf4
will still segfault. I makedf2["b"]
a Categorical to show that having an empty category list seems to be essential to trigger the problem. The cast seems to recompute the categories even when the type is not actually changing.Expected behavior
Sort the dataset without crashing.
Installed versions
The text was updated successfully, but these errors were encountered: