-
-
Notifications
You must be signed in to change notification settings - Fork 63
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
how to deal with mixed data ? #3
Comments
Thanks. You can specify the breakpoints via option
|
Thanks. very detailed answer. In my case, i mean the value '666' and '888' is a categorical variable. so we should convert it as a factor before woebin. special valuesp<-c(666,888) convert it to factordat_sp$x<-as.factor(dat_sp$x) bin for normal databins_nor <- woebin(dat_nor, "y") and now the question is 1) how to combine these two plot in one plot. 2) how to combine these two woe for the variable in this case becase we can't do it by rbind function simply. if we have many such variable , how to get woe ? what I really warried is that we can't do bin for many variables automatically. for example,many functions in your package support batch process, obviously,if we bin for special value and normal value respectively,it destroies batch process. |
get optimal breakpoints for rest datasetbins <- woebin(dat[x != 666 & x != 888], "y") |
It could be a problem if you have many variable that need to handle in this way. |
Thanks again for your nice solution! Looking forward to your improved version. |
see the following example: library(scorecard) dat <- data.frame(y=c(0,0,0,1,1,1,1,1,0,0,1,1,1,0,0,0,1,1,1,0), #' specify two values as two class |
great package on this subject! very nice job! here i have a problem.for example,I have a variable, such as ,dat<-data.frame(y=c(0,0,0,1,1,1,1,1,0,0,1,1,1,0,0,0,1,1,1,0),x=c(1,2,3,4,5,888,888,888,9,10,666,666,666,666,15,16,17,18,19,20)). In this case,i want regard '888' and '666'as special two class such as missing value have own woe, and i want to get two woe for '888' and '666' separately. other values are computed as usual. How to handle this type data. Thanks!
The text was updated successfully, but these errors were encountered: