-
Notifications
You must be signed in to change notification settings - Fork 316
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Numerical unknowns should not be converted to sdv-pii-???? #2089
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! Just checking, it it an issue for HMA only? @lajohn4747, @amontanez24
It appears to be that way after some local testing |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think we should force the reversed type to be object. The reversed type should match whatever the input was. In the case where the original type was numerical, I don't think we should be adding that prefix
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we also add an integration test with the example in the issue
resolves #2064
CU-86b0wh849
PII Type adds the prefix
sdv-pii
. If the sdtype is unknown, the transformers will auto-assign but the reverse transform for apii
sdtype should always have the dtype ofobject
as it will contain a prefix. If a numerical column is detected we will add change theFaker
function to usenumerify
and cap at the max amount of digits.Adjusted some unit test to avoid test failures due to mocking