tibble() should auto-splice unnamed tibble columns #581

hadley · 2019-02-21T22:17:11Z

For compatibility with proposed dplyr semantics

krlmlr · 2019-02-22T00:12:28Z

I don't understand auto-splicing. Should anything change here:

library(rlang)
library(tibble)

quos <- quos(a = 5)
tibble(!!!quos)
#> # A tibble: 1 x 1
#>       a
#>   <dbl>
#> 1     5
tibble(quos)
#> # A tibble: 1 x 1
#>   quos          
#>   <S3: quosures>
#> 1 ~5

^{Created on 2019-02-22 by the reprex package (v0.2.1.9000)}

lionel- · 2019-02-22T08:20:43Z

tibble(tibble(a = 1)) would be equivalent to tibble(a = 1).
tibble(A = tibble(a = 1)) would create a df-col.

The idea is that giving a name to a tibble namespaces it, i.e. creates a df-col. There were discussions about auto-splicing vs namespacing semantics in tidyverse/dplyr#3721, tidyverse/dplyr#4169 , tidyverse/dplyr#3967, and r-lib/tidyselect#86. I'll write a meta issue about it next week.

Implementation might be tricky. It involves turning off auto-labelling of quosures and manually labelling unnamed objects which are not tibbles at the right time. We now have rlang::as_label() for this.

hadley · 2019-03-04T13:41:30Z

as_tibble() would need the same treatment.

jennybc · 2019-03-04T15:59:33Z

I feel compelled to mention that having shape or type depend on the presence/absence of names has felt awkward in the past. I think that comes up in the long-running purrr::map_df[rc]() conversation as well.

krlmlr · 2019-06-29T23:55:57Z

This feels awkward. Users can always splice manually. What's the advantage of auto-splicing?

Thinking about namespaces in C++: an unnamed namespace is still a namespace and doesn't somehow get embedded into its parent. Auto-named tibble columns feel similar.

lionel- · 2019-07-01T12:56:03Z

Auto-splicing doesn't use complicated syntax and doesn't have problems of evaluation timing.

It will be important to have auto-splicing in dplyr to implement mapping():

data %>%
  mutate(
    bar = bar(),
    mapping(starts_with("foo"), ~ .x / sd(.x))
  )

If you splice, mapping() will be evaluated too early and won't get access to the tidyselect variables.

hadley · 2019-07-01T12:59:14Z

And we're trying it out with tibble() as it seems like a low-risk environment. If adding this behaviour to tibble() causes significant problems, we'll re-evaluate.

krlmlr · 2019-07-01T13:26:27Z

I'm not yet convinced the principal constructor for data frames in the tidyverse is a low-risk environment ;-)

I'm especially concerned about the handling of name conflicts. Even if we apply tidyverse rules, this may lead to very surprising behavior.

To reiterate the argument:

We are implementing (or planning to) adverbs like mapping() that work inside other verbs and generate a data frame. The net effect should be creation and updating of columns in the existing data frame:

tibble(foo1 = 1:3, foo2 = 2:4, bar = 5) %>%
  mutate(
    bar = 6,
    mapping(starts_with("foo"), ~ .x / sd(.x))
  )
## tibble(foo1 = as.numeric(1:3), foo2 = as.numeric(2:4), bar = 6)

I suppose the following should also work:

tibble(foo1 = 1:3, foo2 = 2:4, bar = 5) %>%
  mutate(
    bar = 6,
    tibble(foo1 = 2:4, foo2 = 3:5)
  )
## tibble(foo1 = 2:4, foo2 = 3:5, bar = 6)

For consistency, tibble() should behave identically.

Am I missing anything?

As an alternative, how about an auto-splicing adverb:

tibble(foo1 = 1:3, foo2 = 2:4, bar = 5) %>%
  mutate(
    bar = 6,
    auto_splice(tibble(foo1 = 2:4, foo2 = 3:5))
  ) %>%
  flatten_auto_splice()
## tibble(foo1 = 2:4, foo2 = 3:5, bar = 6)

auto_splice() would work by attaching a class to the object, which would be picked up by flatten_auto_splice() . Of course flatten_auto_splice() could become embedded into mutate() and tibble() .

lionel- · 2019-07-01T13:44:02Z

An auto_splice() adverb seems less elegant.

We have already started using the named/unnamed syntax in vec_cbind(), it wouldn't be convenient to use an adverb there.

hadley · 2019-07-01T14:37:27Z

It is low risk because I find it very hard to imagine that anyone is counting on the current behaviour:

library(tibble)
names(tibble(tibble(x = 1, y = 2)))
#> [1] "tibble(x = 1, y = 2)"

^{Created on 2019-07-01 by the reprex package (v0.3.0)}

krlmlr · 2019-08-07T15:16:41Z

There's also:

names(tibble::tibble(mtcars))
#> [1] "mtcars"

^{Created on 2019-08-07 by the reprex package (v0.3.0)}

Are we still considering auto-splice? I'm happy to implement, just double-checking.

github-actions · 2020-12-08T00:39:13Z

This old thread has been automatically locked. If you think you have found something related to this, please open a new issue and link to this old issue if necessary.

krlmlr modified the milestone: 2.x.y Feb 22, 2019

hadley mentioned this issue Mar 21, 2019

use of variable names vs base data.frame() #569

Closed

hadley added the vctrs ↗️ Requires vctrs package label May 29, 2019

krlmlr mentioned this issue Jun 24, 2019

Should tibble::tibble() use quos(..., .named = TRUE)? #613

Closed

krlmlr mentioned this issue Jul 1, 2019

Splicing during evaluation? #617

Closed

krlmlr mentioned this issue Oct 23, 2019

Automatically unpack unnamed df-cols tidyverse/dplyr#2326

Closed

krlmlr closed this as completed in e0bcae3 Oct 25, 2019

github-actions bot locked and limited conversation to collaborators Dec 8, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tibble() should auto-splice unnamed tibble columns #581

tibble() should auto-splice unnamed tibble columns #581

hadley commented Feb 21, 2019

krlmlr commented Feb 22, 2019

lionel- commented Feb 22, 2019

hadley commented Mar 4, 2019

jennybc commented Mar 4, 2019

krlmlr commented Jun 29, 2019

lionel- commented Jul 1, 2019

hadley commented Jul 1, 2019

krlmlr commented Jul 1, 2019

lionel- commented Jul 1, 2019

hadley commented Jul 1, 2019

krlmlr commented Aug 7, 2019

github-actions bot commented Dec 8, 2020

tibble() should auto-splice unnamed tibble columns #581

tibble() should auto-splice unnamed tibble columns #581

Comments

hadley commented Feb 21, 2019

krlmlr commented Feb 22, 2019

lionel- commented Feb 22, 2019

hadley commented Mar 4, 2019

jennybc commented Mar 4, 2019

krlmlr commented Jun 29, 2019

lionel- commented Jul 1, 2019

hadley commented Jul 1, 2019

krlmlr commented Jul 1, 2019

lionel- commented Jul 1, 2019

hadley commented Jul 1, 2019

krlmlr commented Aug 7, 2019

github-actions bot commented Dec 8, 2020