Simplify group context detection #6552

DavisVaughan · 2022-11-18T18:56:20Z

We now detect whether or not we need to add group context by checking if the mask$get_current_group() value is 0 or not. If it is 0, we are either before or after the actual computation of the group results. Otherwise we must be inside some kind of group computation, or we have hardcoded the group using mask$set_current_group(), which we do in a few places.

The most important change was probably resetting the group index to 0 in DPLYR_MASK_FINALISE() so that it is 0 after the group computations too.

This allowed me to remove the special error wrapping in pick() and in the across() deprecation.

R/across.R

DavisVaughan · 2022-11-18T18:57:31Z

R/conditions.R

-    "dplyr:::error_incompatible_combine",
-    "dplyr:::mutate_mixed_null",
-    "dplyr:::mutate_constant_recycle_error",
-    "dplyr:::summarise_mixed_null",


These 4 error classes are still needed to dispatch off for things like error bullet generation, but we no longer need to special case them here

R/conditions.R

R/pick.R

DavisVaughan · 2022-11-18T19:00:56Z

src/summarise.cpp

-    dplyr::stop_summarise_mixed_null();
+    const SEXP* p_chunks = VECTOR_PTR_RO(chunks);
+
+    for (R_xlen_t i = 0; i < ngroups; i++) {
+      if (p_chunks[i] == R_NilValue) {
+        // Find out the first time the group was `NULL`
+        // so that the error will be associated with this group
+        DPLYR_MASK_SET_GROUP(i);
+        dplyr::stop_summarise_mixed_null();
+      }
+    }


This is the error path that occurs when you happen to return a NULL from one group and a non-NULL value in another group

In mutate() we already do something similar (report the group with the first problematic NULL), so this is just us being consistent with that now

DavisVaughan · 2022-11-18T19:02:46Z

tests/testthat/_snaps/across.md

      i In argument `..1 = (if_any())`.
+      i In group 1: `g = 1`.


Minor improvement

We used to always skip adding the group context if we saw a warning_across_missing_cols_deprecated warning class. But during the evaluation path (i.e. not expansion) we actually do have the group context, so we can report it

DavisVaughan · 2022-11-18T19:04:29Z

tests/testthat/_snaps/mutate.md

@@ -164,6 +164,18 @@
      <error/dplyr:::mutate_error>
      Error in `mutate()`:
      i In argument: `..1 = if (a == 1) NULL else "foo"`.
+      i In group 1: `a = 1`.


Another small improvement

Again, we used to always skip adding the group context if mutate_mixed_null was the error class, but we actually do set the group value explicitly in the C++ code for this case, and we can now let that bubble up

DavisVaughan · 2022-11-18T19:05:37Z

tests/testthat/test-across.R

+  times_two <- function(x) x * 2
+
  # Expansion path
-  expect_snapshot(out <- mutate(df, z = across()))
-  expect_identical(out$z, df)
-  expect_snapshot(out <- mutate(gdf, z = across()))
-  expect_identical(out$z, df[c("x", "y")])
+  expect_snapshot(out <- mutate(df, across(.fns = times_two)))


I realized this wasn't actually doing the "expansion path" of across() because it was a named call to across(), i.e. the z = , so I fixed that here. It ends up tweaking the warning in the snapshot test a little

lionel-

Nice improvement!

R/across.R

lionel- · 2022-11-21T08:23:30Z

R/conditions.R

-    "dplyr:::error_incompatible_combine",
-    "dplyr:::mutate_mixed_null",
-    "dplyr:::mutate_constant_recycle_error",
-    "dplyr:::summarise_mixed_null",


src/summarise.cpp

tests/testthat/_snaps/summarise.md

Previously we never actually used the expansion path because named across calls went through the evaluation path

It is `0L` before any group evaluation, and we now ensure that it is `0L` on the way out too, through `DPLYR_MASK_FINALISE()` This ensures that we can continue to use `$set_current_group()` on the fly right before an `abort()` call to force a group location

We already do this for `mutate()`, so this is just us being consistent

DavisVaughan commented Nov 18, 2022

View reviewed changes

DavisVaughan requested a review from lionel- November 18, 2022 19:06

lionel- approved these changes Nov 21, 2022

View reviewed changes

DavisVaughan force-pushed the fix/group-context-errors branch 2 times, most recently from cf27e06 to ac57d3d Compare November 21, 2022 13:59

DavisVaughan added 9 commits November 21, 2022 09:24

Use more accurate across() test

012057c

Previously we never actually used the expansion path because named across calls went through the evaluation path

Add another group specific mutate() with mixed nulls test

8ac9292

Add another group specific summarise() with mixed nulls test

c9f835d

Report the group of the first mixed-NULL issue in summarise()

f4d06a1

We already do this for `mutate()`, so this is just us being consistent

Remove eval_select() error wrapping

dea4e3a

Remove across() deprecation wrapping

397e998

Add test specifically for tidyverse#6534

48562da

Use up to date naming scheme for new code

c79e5ac

DavisVaughan force-pushed the fix/group-context-errors branch from ac57d3d to c79e5ac Compare November 21, 2022 14:26

DavisVaughan merged commit 370fdf0 into tidyverse:main Nov 21, 2022

DavisVaughan deleted the fix/group-context-errors branch November 21, 2022 14:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify group context detection #6552

Simplify group context detection #6552

DavisVaughan commented Nov 18, 2022

DavisVaughan Nov 18, 2022

lionel- Nov 21, 2022

DavisVaughan Nov 18, 2022

DavisVaughan Nov 18, 2022

DavisVaughan Nov 18, 2022

DavisVaughan Nov 18, 2022

lionel- left a comment

lionel- Nov 21, 2022

Simplify group context detection #6552

Simplify group context detection #6552

Conversation

DavisVaughan commented Nov 18, 2022

DavisVaughan Nov 18, 2022

Choose a reason for hiding this comment

lionel- Nov 21, 2022

Choose a reason for hiding this comment

DavisVaughan Nov 18, 2022

Choose a reason for hiding this comment

DavisVaughan Nov 18, 2022

Choose a reason for hiding this comment

DavisVaughan Nov 18, 2022

Choose a reason for hiding this comment

DavisVaughan Nov 18, 2022

Choose a reason for hiding this comment

lionel- left a comment

Choose a reason for hiding this comment

lionel- Nov 21, 2022

Choose a reason for hiding this comment