-
Notifications
You must be signed in to change notification settings - Fork 0
🦙 Missingness encoder is here #7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
The fields of `fitted_params(mach)` are: | ||
- `new_cat_given_col_val`: A dictionary that for each column, maps `missing` into some value according to `label_for_missing` |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we call this
label_for_missing_given_feature` or similar? See my comments in the contrast encoding PR.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looking good, thank you. 🦾
Co-authored-by: Anthony Blaom, PhD <anthony.blaom@gmail.com>
Co-authored-by: Anthony Blaom, PhD <anthony.blaom@gmail.com>
Addressed all points and will merge. Thank you so much for the nice review. |
This PR introduces the missingness encoder. It shares an implementation/testing framework very similar to the cardinality reducer.