-
Notifications
You must be signed in to change notification settings - Fork 107
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
step-na-indicate #617
step-na-indicate #617
Conversation
Looks good! Some minor notes in-line. |
@topepo - what do you think about this proposal from my first message?
For example: right now if somebody called |
I don't want to make selectors that use the missing value pattern to select predictors, if that's what you mean. |
I think in 99% of the cases, we would not want to remove the original columns. Both are needed for the type of data analysis that people would do with these missing indicators. |
@topepo I've worked on the points you mentioned and added some basic tests. However, there's a warning showing up now (likely related to Warning message:
All elements of `...` must be named.
Did you want `data = c(type, role, source)`?
Ok, got you here. I both agree and disagree with that approach, but I guess that not having done that automatically will reduce unwanted/ unexpected surprises. |
That message is from a |
Also, can you update from master (to avoid some of the documentation conflicts)? |
─ Session info ───────────────────────────────────────────────────────────────────────────────────────
setting value
version R version 3.6.1 (2019-07-05)
os macOS Catalina 10.15.3
system x86_64, darwin15.6.0
ui RStudio
language (EN)
collate en_US.UTF-8
ctype en_US.UTF-8
tz Europe/Berlin
date 2020-12-17
─ Packages ───────────────────────────────────────────────────────────────────────────────────────────
! package * version date lib source
assertthat 0.2.1 2019-03-21 [1] CRAN (R 3.6.0)
backports 1.1.7 2020-05-13 [1] CRAN (R 3.6.2)
callr 3.4.3 2020-03-28 [1] CRAN (R 3.6.2)
class 7.3-15 2019-01-01 [1] CRAN (R 3.6.1)
cli 2.2.0 2020-11-20 [1] CRAN (R 3.6.2)
crayon 1.3.4 2017-09-16 [1] CRAN (R 3.6.0)
desc 1.2.0 2018-05-01 [1] CRAN (R 3.6.0)
devtools 2.2.1 2019-09-24 [1] CRAN (R 3.6.0)
digest 0.6.25 2020-02-23 [1] CRAN (R 3.6.0)
dplyr * 1.0.2 2020-08-18 [1] CRAN (R 3.6.2)
ellipsis 0.3.1 2020-05-15 [1] CRAN (R 3.6.2)
fansi 0.4.1 2020-01-08 [1] CRAN (R 3.6.1)
fs 1.4.1 2020-04-04 [1] CRAN (R 3.6.2)
generics 0.1.0 2020-10-31 [1] CRAN (R 3.6.2)
glue 1.4.1 2020-05-13 [1] CRAN (R 3.6.2)
gower 0.2.1 2019-05-14 [1] CRAN (R 3.6.0)
ipred 0.9-9 2019-04-28 [1] CRAN (R 3.6.0)
lattice 0.20-38 2018-11-04 [1] CRAN (R 3.6.1)
lava 1.6.7 2020-03-05 [1] CRAN (R 3.6.0)
lifecycle 0.2.0 2020-03-06 [1] CRAN (R 3.6.0)
lubridate 1.7.8 2020-04-06 [1] CRAN (R 3.6.2)
magrittr 2.0.1 2020-11-17 [1] CRAN (R 3.6.2)
MASS 7.3-51.4 2019-03-31 [1] CRAN (R 3.6.1)
Matrix 1.2-17 2019-03-22 [1] CRAN (R 3.6.1)
memoise 1.1.0 2017-04-21 [1] CRAN (R 3.6.0)
nnet 7.3-12 2016-02-02 [1] CRAN (R 3.6.1)
packrat 0.5.0 2018-11-14 [1] CRAN (R 3.6.0)
pillar 1.4.4 2020-05-05 [1] CRAN (R 3.6.2)
pkgbuild 1.0.8 2020-05-07 [1] CRAN (R 3.6.2)
pkgconfig 2.0.3 2019-09-22 [1] CRAN (R 3.6.0)
pkgload 1.1.0 2020-05-29 [1] CRAN (R 3.6.2)
prettyunits 1.1.1 2020-01-24 [1] CRAN (R 3.6.1)
processx 3.4.2 2020-02-09 [1] CRAN (R 3.6.0)
prodlim 2019.11.13 2019-11-17 [1] CRAN (R 3.6.0)
ps 1.3.3 2020-05-08 [1] CRAN (R 3.6.2)
purrr 0.3.4 2020-04-17 [1] CRAN (R 3.6.2)
R6 2.4.1 2019-11-12 [1] CRAN (R 3.6.0)
Rcpp 1.0.4 2020-03-17 [1] CRAN (R 3.6.0)
P recipes * 0.1.6.9000 2020-12-14 [?] local
remotes 2.1.0 2019-06-24 [1] CRAN (R 3.6.0)
rlang 0.4.9 2020-11-26 [1] CRAN (R 3.6.2)
rpart 4.1-15 2019-04-12 [1] CRAN (R 3.6.0)
rprojroot 1.3-2 2018-01-03 [1] CRAN (R 3.6.0)
rstudioapi 0.13 2020-11-12 [1] CRAN (R 3.6.2)
sessioninfo 1.1.1 2018-11-05 [1] CRAN (R 3.6.0)
survival 2.44-1.1 2019-04-01 [1] CRAN (R 3.6.1)
testthat * 2.3.2 2020-03-02 [1] CRAN (R 3.6.0)
tibble 3.0.4 2020-10-12 [1] CRAN (R 3.6.2)
tidyr 1.1.2 2020-08-27 [1] CRAN (R 3.6.2)
tidyselect 1.1.0 2020-05-11 [1] CRAN (R 3.6.2)
timeDate 3043.102 2018-02-21 [1] CRAN (R 3.6.0)
usethis 1.5.1 2019-07-04 [1] CRAN (R 3.6.0)
vctrs 0.3.5 2020-11-17 [1] CRAN (R 3.6.2)
withr 2.2.0 2020-04-20 [1] CRAN (R 3.6.2)
[1] /Library/Frameworks/R.framework/Versions/3.6/Resources/library
P ── Loaded and on-disk path mismatch. |
Hmm, I'm not sure why these error are showing up... I've checked that I was up-to-date with master. |
This is looking really close @konradsemsch! 🙌 It seems like this branch was created from the main branch of recipes at a point is the "distant" past 🦖 👵 and some very old files have gotten included in the PR, which is causing some problems for us in installing the PR and testing it out. Such a bummer! Can we ask you to help us out in creating a more streamlined PR that will be easier to test and check? Here are the steps we think you should take:
|
Thank you @juliasilge for explaining! Indeed something was mixed up over there, but now we can start clean :) Things are looking good on the new PR: here. Closing this one. |
This pull request has been automatically locked. If you believe you have found a related problem, please file a new issue (with reprex) and link to this issue. https://reprex.tidyverse.org |
Hi @juliasilge!
I've fitted my earlier code into the current implementation of
recipes
! As of now the implementation is pretty basic and I was thinking also of adding one more functionality that I wanted to check with you: creating indicator variables only for those features that had missing values/ certain prop of missing values in the first place. What do you think? How to make it fit in the entirerecipes
workflow? Would you have any other recommendations/ ideas?On my list are also some tests, but wanted to agree on a baseline implementation first :) Let me know!