Fill downup updown #504

coolbutuseless · 2018-10-16T06:48:39Z

Add option to fill() to both fill-down-then-up and fill-up-then-down.

This is to replace a common idiom of mine, i.e.

df %>%
  group_by(group) %>%
  tidyr::fill(value, .direction = 'down') %>%
  tidyr::fill(value, .direction = 'up') %>%
  ungroup()

which could become

df %>%
  group_by(group) %>%
  tidyr::fill(value, .direction = 'downup') %>%
  ungroup()

Depending upon number of groups and number of variables to replace, the current duplicate call to fill() can be avoided, giving significant speed savings.

hadley · 2018-10-23T19:18:18Z

Can you explain why you'd want to do this?

* Update lazyeval compat file * Unquote scalar quosure with !! * Use as_string(ensym()) rather than quo_name(enquo()) This is a much more robust way of capturing symbols

* Add uncount to ref index * Build reference index w/ parens * Fixes #480

Fixes #397

coolbutuseless · 2018-10-24T10:04:27Z

A situation where I do this

I have values only known at some particular time and I need to fill this value both forwards and backwards in time.

A particular example

I work with clinical trial data, which is often provided in multiple files.

In the process of making a data set for analysis, particular information may only be recorded at certain events/times, but need to be filled forward/back in time throughout a related time period.

It is only valid to fill up/down within certain groupings (e.g. subjects, day, part of study) - with lots of subjects and lots of groups, this filling can take a noticeable amount of time.

Also filling may be done within different groupings for different variables.

A simplified concrete example:

suppressPackageStartupMessages({
  library(dplyr)
})

# Weight only recorded at event_type = 1, but considered
# valid across the entire event_num.
# If 'wt' not defined for a given event num, it may be
# carried forwards from a prior run, or backwards from a following run
df <- tibble::tribble(
  ~subject, ~time, ~event_type, ~event_num,  ~wt,
  1       ,     1,           0,          1,   NA,
  1       ,     2,           0,          1,   NA,
  1       ,     3,           1,          1,   20,
  1       ,     4,           0,          1,   NA,
  1       ,     5,           0,          1,   NA,
  1       ,     1,           0,          2,   NA,
  1       ,     2,           0,          2,   NA,
  1       ,     3,           1,          2,   NA,
  1       ,     4,           0,          2,   NA,
  1       ,     5,           0,          2,   NA,
  1       ,     1,           0,          3,   NA,
  1       ,     2,           0,          3,   NA,
  1       ,     3,           1,          3,   30,
  1       ,     4,           0,          3,   NA,
  1       ,     5,           0,          3,   NA,
)

# fill wt down/up within the event_num for each subject,
# then down/up within subject only.
df %>%
  group_by(subject, event_num) %>%
  tidyr::fill(wt, .direction = 'down') %>%
  tidyr::fill(wt, .direction = 'up'  ) %>%
  group_by(subject) %>%
  tidyr::fill(wt, .direction = 'down') %>%
  tidyr::fill(wt, .direction = 'up'  ) %>%
  ungroup()
#> # A tibble: 15 x 5
#>    subject  time event_type event_num    wt
#>      <dbl> <dbl>      <dbl>     <dbl> <dbl>
#>  1       1     1          0         1    20
#>  2       1     2          0         1    20
#>  3       1     3          1         1    20
#>  4       1     4          0         1    20
#>  5       1     5          0         1    20
#>  6       1     1          0         2    20
#>  7       1     2          0         2    20
#>  8       1     3          1         2    20
#>  9       1     4          0         2    20
#> 10       1     5          0         2    20
#> 11       1     1          0         3    30
#> 12       1     2          0         3    30
#> 13       1     3          1         3    30
#> 14       1     4          0         3    30
#> 15       1     5          0         3    30

Created on 2018-10-24 by the reprex
package (v0.2.0).

* Add missing tests for spread with fill * Remove duplicate test for gather. The test below the removed one, with the same name, covers exactly the same test cases (and more) * Add missing test for id with high dimension * Use tibble in test-spread.r * Ensure /tests is lint-free * Resolve conflict and small style points

To quiet glue deprecation message

Part of #512 See r-lib/rlang#675

echasnovski · 2018-10-26T10:45:09Z

I also had several occasions where this type of functionality would be useful. However, I'd phrase them slightly differently: replace missing values based on the closest row (by some column, usually time). In case of an equal distance, use direction argument.

If data is ordered by reference column then these downup and updown solve this problem.

Also url in order to generate CNAME

* Add CODE_OF_CONDUCT, CONTRIBUTING, ISSUE_TEMPLATE, and SUPPORT

Including creating new low-level expand_grid(). Is likely to have some changes to revdeps, but they should be because we're making tidyr more consistent with other tidyverse functions so they should be worth it. Fixes #557. Fixes #490.

Initial implementation of `pivot()` as replacement for `gather()` and `spread()`. Initial documentation and a vignette, but still needs much polishing.

Fixes #497

Part of #472

Fixes #478

Part of #496

…into fill-downup-updown

coolbutuseless · 2019-03-04T09:50:47Z

OK. I think i totally hosed this PR by trying to sync it with current master. :/

Burn to the ground and start again? I can't see a solution...

…into fill-downup-updown

hadley · 2019-03-04T13:11:30Z

In the future, you might try usethis::pr_pull_source() which should do the right thing to get your branch synced back with master.

coolbutuseless added 2 commits October 16, 2018 16:39

add 'downup' and 'updown' options for fill

5adbebd

update callsignatures for lazyeval versions of fill

41d57b9

coolbutuseless mentioned this pull request Oct 16, 2018

More fill() options - fill-down-then-up, and fill-up-then-down #505

Closed

fix function order

c448525

hadley and others added 9 commits October 23, 2018 14:18

Update roxygen2

acba1ad

Update lazyeval compat file (#495)

3e6dcab

* Update lazyeval compat file * Unquote scalar quosure with !! * Use as_string(ensym()) rather than quo_name(enquo()) This is a much more robust way of capturing symbols

'column names' to 'column types' (#488)

c1157f5

Change dplyr::tbl_df() to tibble::as_tibble() (#484)

a1cd63e

Build reference index w/ parens (#481)

57f1f89

* Add uncount to ref index * Build reference index w/ parens * Fixes #480

Link recoding functions together (#462)

a768bec

Omit columns named NA from separate result (#445)

1481fa4

Fixes #397

Doc tweaks

1cf2294

Use new build matrix

fbdeef2

rbloehm and others added 6 commits October 24, 2018 07:35

Bump tidyselect dependency

1d84a5b

To quiet glue deprecation message

Update news

7c15885

Re-run revdeps

3e14237

Use variant of ensym() that supports quosures (#513)

c318c31

Part of #512 See r-lib/rlang#675

Update cran comments post @lionel- revdep checks

61b77f0

hadley and others added 8 commits October 26, 2018 16:10

Prepare for release

ea27944

Automate site building

6277fb7

Remove extra release check

a07279f

Remove site so it can be auto-built

8a42d07

Also url in order to generate CNAME

Increment version number

4896b92

Back out of auto mode for now

6c0e510

Add use_tidy_github() guides (#516)

21cfd1b

* Add CODE_OF_CONDUCT, CONTRIBUTING, ISSUE_TEMPLATE, and SUPPORT

Don't import questioning or deprecated purrr functions

28af2f1

hadley and others added 19 commits March 2, 2019 09:21

Expand and friends rewrite (#561)

0f31248

Including creating new low-level expand_grid(). Is likely to have some changes to revdeps, but they should be because we're making tidyr more consistent with other tidyverse functions so they should be worth it. Fixes #557. Fixes #490.

Pivot (#564)

51f9cb1

Initial implementation of `pivot()` as replacement for `gather()` and `spread()`. Initial documentation and a vignette, but still needs much polishing.

Add widening pivots to vignette

9e8a869

Add pivotting example that needs new key variable

438dac6

Fixes #497

Only consider unique column names

dbfdc7b

Part of #472

Note what issues pivot() closes

a109650

Add test for renaming

29cab54

Fixes #478

Correctly interleave values

f0fad62

Part of #496

Update NEWS for name conflicts

5ce48b5

add 'downup' and 'updown' options for fill

c0c65ec

update callsignatures for lazyeval versions of fill

356794b

fix function order

4675728

change from compose() to anonymous function

956202d

Merge branch 'fill-downup-updown' of github.com:coolbutuseless/tidyr …

eb57216

…into fill-downup-updown

add 'downup' and 'updown' options for fill

86c324c

update callsignatures for lazyeval versions of fill

ea8e4a7

fix function order

d98e2a8

change from compose() to anonymous function

d54921e

Merge branch 'fill-downup-updown' of github.com:coolbutuseless/tidyr …

837356f

…into fill-downup-updown

coolbutuseless added 5 commits March 4, 2019 19:52

add 'downup' and 'updown' options for fill

6cfefa5

update callsignatures for lazyeval versions of fill

c9d5a06

fix function order

039829d

change from compose() to anonymous function

a820ce6

Merge branch 'fill-downup-updown' of github.com:coolbutuseless/tidyr …

145374b

…into fill-downup-updown

coolbutuseless closed this Mar 4, 2019

coolbutuseless deleted the fill-downup-updown branch March 4, 2019 09:58

coolbutuseless added a commit to coolbutuseless/tidyr that referenced this pull request Mar 4, 2019

Update to PR tidyverse#504. Use anonymous function instead of compose

74d5f71

coolbutuseless mentioned this pull request Mar 4, 2019

Update to PR #504. Use anonymous function instead of compose #567

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fill downup updown #504

Fill downup updown #504

coolbutuseless commented Oct 16, 2018 •

edited

hadley commented Oct 23, 2018

coolbutuseless commented Oct 24, 2018

echasnovski commented Oct 26, 2018

coolbutuseless commented Mar 4, 2019

hadley commented Mar 4, 2019

Fill downup updown #504

Fill downup updown #504

Conversation

coolbutuseless commented Oct 16, 2018 • edited

hadley commented Oct 23, 2018

coolbutuseless commented Oct 24, 2018

A situation where I do this

A particular example

A simplified concrete example:

echasnovski commented Oct 26, 2018

coolbutuseless commented Mar 4, 2019

hadley commented Mar 4, 2019

coolbutuseless commented Oct 16, 2018 •

edited