Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support 'unknown' sdtype #1516

Closed
amontanez24 opened this issue Jul 27, 2023 · 0 comments · Fixed by #1532 or #1610
Closed

Support 'unknown' sdtype #1516

amontanez24 opened this issue Jul 27, 2023 · 0 comments · Fixed by #1532 or #1610
Assignees
Labels
feature request Request for a new feature
Milestone

Comments

@amontanez24
Copy link
Contributor

Problem Description

The metadata updates in #1515 will add a new sdtype called unknown. This sdtype is when we were unable to pick an sdtype and a generic faker should be used.

Expected behavior

  • When assigning transformers, if the sdtype is unknown, assign the following transformer
AnonymizedFaker(
    provider_name=None,
    function_name='bothify',
    text='sdv-pii-?????',
    letters='0123456789abcdefghijklmnopqrstuvwxyz'
)

Additional context

  • The logic for assigning transformers can be found here. We will no longer need the case where we default to categorical transformers if all else fails since every column should have an sdtype or unknown.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request Request for a new feature
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants