You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
ARROW-13344 enabled the dplyr verb summarise() to use the Arrow engine but kept this off by default, controlled by the arrow.debug option.
Before this can be turned on by default, we should ensure that the following are all implemented:
a sufficient set of hash aggregate kernels and R aggregate function mappings to them, covering the vast majority of all aggregate functions that dplyr users call in summarise() (add any additional required ones to ARROW-13339)
support for a sufficient set of data types in aggregates
support for a sufficient set of data types in grouping columns
handling of NA and NaN values in aggregates and the na.rm option consistent with base R and dplyr (ARROW-13497 and possibly other issues)
handling of NA and NaN values in grouping columns consistent with dplyr
handling empty or bad input to summarise() (ARROW-13543)
many new tests to confirm equivalent results from a variety of group_by() %>% summarise() queries on data frames and on Arrow data
ARROW-13344 enabled the dplyr verb
summarise()
to use the Arrow engine but kept this off by default, controlled by thearrow.debug
option.Before this can be turned on by default, we should ensure that the following are all implemented:
summarise()
(add any additional required ones to ARROW-13339)NA
andNaN
values in aggregates and thena.rm
option consistent with base R and dplyr (ARROW-13497 and possibly other issues)NA
andNaN
values in grouping columns consistent with dplyrsummarise()
(ARROW-13543)group_by() %>% summarise()
queries on data frames and on Arrow dataReporter: Ian Cook / @ianmcook
Assignee: Ian Cook / @ianmcook
Related issues:
Note: This issue was originally created as ARROW-13618. Please see the migration documentation for further details.
The text was updated successfully, but these errors were encountered: