New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FEA] Expose isnull/isnan #976
Comments
@dillon-cullinan You should be able to tackle this as part of #1126 since I think you already generate the boolean mask. |
It would also be good (for downstream usage and pandas API compatibility) to expose this functionality as a top level function. |
Lack of the |
@dillon-cullinan are you able to tackle isnull/isna as part of #1126? @beckernick can you confirm if we need to add special logic for handling strings too? |
In terms of blocking cumulative aggregations, since the cumulative aggregations are are only operating on numeric columns these methods could be implemented at the series level and raise NotImplementedErrors (for now) if the column is a StringColumn. In general, I think whether we need different logic than the numba kernel work in #1126 depends on whether the StringColumn null mask behaves the same way as a cudf::column. If it does, I think we're fine. |
As an update, this specific issue is not necessarily blocking the dask functionality. Either |
Data is often missing and messy. It would be nice to expose
isna
andisnull
methods on series objects. These methods are often used for during filter operations like those below:The text was updated successfully, but these errors were encountered: