untracked: Allow user to specify more aggregation functions, include … #54

oliverbock · 2024-04-15T05:58:02Z

…coundistinct

…ndistinct

oliverbock · 2024-06-19T22:16:00Z

@jrwishart ?

jrwishart · 2024-06-20T02:39:14Z

@jrwishart ?

Apologies. I was away the entire month of April and this slipped through the cracks on my return. Looking now.

jrwishart

Only major thing is to clarify the distinctBy vs distinct_by naming as I fear this might be an unintended bug?
Otherwise minor, non-blocking quibbles.

jrwishart · 2024-06-20T02:34:15Z

R/Factbase.R

+    if (!is.null(distinct_by))
+        metric$distinctBy <- distinct_by


The if check is unnecessary. NULL assignments to lists only occur in the constructing call of a list. When modifying, assigning NULL to a list element is the same as removing that element.

> list(x = 1, y = 2, z = NULL) $x [1] 1 $y [1] 2 $z NULL > l <- list(x = 1, y = 2) > l$z <- NULL > l $x [1] 1 $y [1] 2

Also related is my confusion with distinct_by and distinctBy (there might be a good reason for this but I was confused in a later comment too).

jrwishart · 2024-06-20T02:51:02Z

R/Factbase.R

+        return (list(aggregation = aggregation, distinct_by=NULL))
+    distinct_by <- str_match(aggregation, 'countdistinct\\(([^)]+)\\)')[1, 2]
+    if (is.character(distinct_by)) {
+        if (!(distinct_by %in% names(data)))
+            stop(paste0("Column '", distinct_by, "' is referred to in 'aggregation' but does not exist in 'data'"))
+        return (list(aggregation = 'count', distinctBy = distinct_by))


return list has distinct_by in the first version and distinctBy in the second. I'm guessing one of these needs to be changed for consistency.

Whoops. Thanks.

jrwishart · 2024-06-20T02:51:41Z

R/Factbase.R

+validate_aggregation <- function(aggregation, data) {
+    if (!is.character(aggregation) || length(aggregation) != 1)
+        stop("'aggregation' must be a character vector of length 1")


Is data always intended to be a data.frame? If so, could add a validation check like the aggregation has?

Yes, that is done separately in validate_dataframe() (the data frame is passed in here knowing that it has already been validated)

oliverbock force-pushed the oliver-distinctby branch from 3dbd2ab to 308dcf4 Compare April 15, 2024 06:06

FB-488: Allow user to specify more aggregation functions, include cou…

4d48340

…ndistinct

oliverbock force-pushed the oliver-distinctby branch from 308dcf4 to 4d48340 Compare April 15, 2024 22:20

oliverbock requested a review from jrwishart April 16, 2024 06:26

jrwishart reviewed Jun 20, 2024

View reviewed changes

Code review changes

6e9bc9d

jrwishart approved these changes Jun 20, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

untracked: Allow user to specify more aggregation functions, include … #54

untracked: Allow user to specify more aggregation functions, include … #54

oliverbock commented Apr 15, 2024

oliverbock commented Jun 19, 2024

jrwishart commented Jun 20, 2024

jrwishart left a comment

jrwishart Jun 20, 2024

jrwishart Jun 20, 2024

oliverbock Jun 20, 2024

jrwishart Jun 20, 2024

oliverbock Jun 20, 2024

untracked: Allow user to specify more aggregation functions, include … #54

Are you sure you want to change the base?

untracked: Allow user to specify more aggregation functions, include … #54

Conversation

oliverbock commented Apr 15, 2024

oliverbock commented Jun 19, 2024

jrwishart commented Jun 20, 2024

jrwishart left a comment

Choose a reason for hiding this comment

jrwishart Jun 20, 2024

Choose a reason for hiding this comment

jrwishart Jun 20, 2024

Choose a reason for hiding this comment

oliverbock Jun 20, 2024

Choose a reason for hiding this comment

jrwishart Jun 20, 2024

Choose a reason for hiding this comment

oliverbock Jun 20, 2024

Choose a reason for hiding this comment