Documentation and general cleanup for #493, #495 #496

yjunechoe · 2023-10-29T15:55:39Z

Summary

This will be a series of (mostly) documentation-related changes complementing the two recent PRs:

A complete tidyselect integration for the columns argument in validation functions #493
Enable glue syntax in label to access current column/segment (and possibly others) #495

Below is the (evolving) roadmap for this PR. Please feel free to interject at any point!

1) Chore

Remove the usage of vars() for column selection in docs. vars(a) becomes a and vars(a, b) becomes c(a, b).
Clean up capturing user-expression in columns in validation functions: the input should only be enquo()-ed once
Remove/refactor some utilities no longer used:
- as_vars_fn() (now as_c_fn())
Note any missing gaps in the newly introduced features and add to (3)
Catch and fix any bugs spotted in the process
- resolving c()-expr column selection inside serially() is too eager
- serially() YAML section example doesn't run: "Error: There must be at least one test_*() function call in serially()."
- col_exists() allows all kinds of evaluation errors to go through, not just user-specified selection of non-existent columns

2) Documentation

Document tidyselect in columns param
Document (more complete) tidyselect in Column Names section
Document glue syntax in label param
Document multi-length vector support in label param

Copy pastes:

Make special columns param docs for rows_*() functions, making everything() the default
Edit Column Names section in individual function docs
Add Labels section in individual function docs
- Only show the relevant a subset of glue variables for each function

3) Feature completeness (more of a wishlist)

refactor get_column_text() for writing columns expr to yaml
yaml functions should now write columns as c(a) instead of vars(a) if user only specifies columns = a
- test consumption of columns: c(a) and columns: c(a, b)
Rest are too big/out of scope and moved to section X

4) Finishing touches

News items

glue syntax in label
more complete tidyselect in columns
note beginning of deprecation process for vars() in columns (?)
encourage users to wrap the input to columns in all_of() if it's an external vector (like in dplyr::select())

Additional tests

test examples embedded in sections
ensure all tests pass
ensure pkgdown builds

X) Bigger refactoring tasks that should be handled outside of this PR

yaml should default to reading/writing columns as expression
Remove/refactor some utilities no longer used:
- uses_tidyselect()
- exported_tidyselect_fns()
has_columns() currently relies on columns = vars(...) and could benefit from tidyselect (same situation as col_exists(), just more lightweight)
info_columns() does not allow tidyselect expressions over column type/values like where(is.character) when tbl is loaded lazily.

Related GitHub Issues and PRs

Ref: A complete tidyselect integration for the columns argument in validation functions #493, Enable glue syntax in label to access current column/segment (and possibly others) #495

Checklist

I understand and agree to the Code of Conduct.
I have listed any major changes in the NEWS.
I have added testthat unit tests to tests/testthat for any new functionality.

…tion

…xists()`

…lect()

yjunechoe · 2023-10-30T20:35:22Z

Thanks for the details comments on columns = NULL! I've moved it over to #497 to tackle this separately since it (thankfully) doesn't derail the current direction of this PR.

I've also now made everything() visible as the default value of columns for rows_*() functions.

args(rows_distinct)
#> function (x, columns = tidyselect::everything(), preconditions = NULL, 
#>     segments = NULL, actions = NULL, step_id = NULL, label = NULL, 
#>     brief = NULL, active = TRUE)

Back to copy-pasting!

rich-iannone · 2023-10-30T21:00:05Z

It's really a lot of files! I sometimes regret that choice.

yjunechoe · 2023-10-30T23:18:01Z

I think this more or less covers the necessary doc changes w.r.t. columns and label! I've also removed references to vars() in columns and replaced them with c() (or just mention tidyselect) as much as possible. I'll follow up on the remaining refactoring/bugfix tasks elsewhere.

(Also - any clue about the error in codecov GHA? It runs fine locally and I can't seem to pinpoint the failing test)

rich-iannone

LGTM!!

rich-iannone · 2023-10-31T04:22:19Z

That was a huge amount of work but these are super good changes!

I watched the codecov workflow run and couldn’t get any info about why the tests fail in that workflow specifically. It’s something that could be looked at later.

As for this PR, all approved. Feel free to merge this in at your leisure!

yjunechoe · 2023-10-31T12:44:58Z

Thanks!

Moving codecov debugging to #498

yjunechoe added 4 commits October 29, 2023 10:09

bug fix tidyselect for validation steps inside serially()

f25e659

first pass of vars()-stripping in examples

7f91c61

fix example typo

10bfbae

document()

2f1046d

yjunechoe marked this pull request as draft October 29, 2023 15:55

yjunechoe added 25 commits October 29, 2023 12:15

fix serially() example in yaml section

2f603fe

allow passing dots down into eval_select()

7494a42

fix bug in col_exists() allowing any arbitrary errors in column selec…

908a6ae

…tion

test various column selection failure behaviors of col_exists()

050f89d

tidyselect 0-column selection in col_exists() should fail gracefully

6a59cec

bring back old behavior of error when no columns provided to `col_e…

9dacc71

…xists()`

resolve_columns() passes down validation call context

e3f9675

fix typo in test

2e889dc

lintr

72a030e

clean up yaml_agent_string() mutually exclusive arg logic

ee61368

default to c()-expr when writing columns to yaml

bde99bb

read c()-expr from yaml as language not character

644c948

change some yaml tests to expect writing to c()

751d06a

test columns c()-expr roundtrip

44d510e

test defaulting to c() for wrapping columns in yaml

f42385a

remove vars() from yaml section in docs

895eee9

more cleanup of vars() in docs

a0d696d

document()

98d2718

enquo() columns only once

07c6636

document generic glue and multi-length vector support in label

22e5b23

point to Label section for more info

883078c

give tidy-select to columns argument signature and reference dplyr se…

12002d0

…lect()

update Column Names section

56d82ff

add Labels section

dbd424d

wording

2748a5c

This comment was marked as duplicate.

Sign in to view

yjunechoe added 5 commits October 30, 2023 14:10

keep e.g. style

701ce84

explicit everything() default for columns arg in formals

045aa70

repeat for expect and test

aefc633

separate out yaml tests for columns

f787627

prune NULL=everything() code

7bfa3e7

This comment was marked as duplicate.

Sign in to view

yjunechoe added 3 commits October 30, 2023 16:02

rows*() functions write column exprs to yaml

a3db90a

edit tests to expect column exprs from rows* functions

da379f9

test everything() round-tripping

559be0c

yjunechoe mentioned this pull request Oct 30, 2023

Better handling for bad column selections #497

Closed

yjunechoe added 2 commits October 30, 2023 16:31

document()

8ae7d5a

document everything() default for columns

dfad342

yjunechoe added 7 commits October 30, 2023 17:16

update column names section

7b6029f

document()

de6ec55

col_exists inherits tidyselect column signature

732ef04

add Labels section to individual functions

7bb6050

document()

43356ec

add NEWS item for tidyselect in columns

f5a6225

remove reference to vars()

51099b9

yjunechoe marked this pull request as ready for review October 30, 2023 23:18

rich-iannone approved these changes Oct 31, 2023

View reviewed changes

yjunechoe merged commit ed6f948 into rstudio:main Oct 31, 2023
12 of 13 checks passed

yjunechoe deleted the tidyselect-coverage-cleanup branch October 31, 2023 12:44

yjunechoe mentioned this pull request Feb 29, 2024

Release pointblank 0.12.0 #522

Closed

18 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Documentation and general cleanup for #493, #495 #496

Documentation and general cleanup for #493, #495 #496

yjunechoe commented Oct 29, 2023 •

edited

Loading

This comment was marked as duplicate.

This comment was marked as duplicate.

yjunechoe commented Oct 30, 2023

rich-iannone commented Oct 30, 2023

yjunechoe commented Oct 30, 2023

rich-iannone left a comment

rich-iannone commented Oct 31, 2023

yjunechoe commented Oct 31, 2023 •

edited

Loading

Documentation and general cleanup for #493, #495 #496

Documentation and general cleanup for #493, #495 #496

Conversation

yjunechoe commented Oct 29, 2023 • edited Loading

Summary

1) Chore

2) Documentation

3) Feature completeness (more of a wishlist)

4) Finishing touches

News items

Additional tests

X) Bigger refactoring tasks that should be handled outside of this PR

Related GitHub Issues and PRs

Checklist

This comment was marked as duplicate.

This comment was marked as duplicate.

yjunechoe commented Oct 30, 2023

rich-iannone commented Oct 30, 2023

yjunechoe commented Oct 30, 2023

rich-iannone left a comment

Choose a reason for hiding this comment

rich-iannone commented Oct 31, 2023

yjunechoe commented Oct 31, 2023 • edited Loading

yjunechoe commented Oct 29, 2023 •

edited

Loading

yjunechoe commented Oct 31, 2023 •

edited

Loading