-
Notifications
You must be signed in to change notification settings - Fork 64
Add multi-table UniformSynthesizer #497
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add multi-table UniformSynthesizer #497
Conversation
Codecov Report❌ Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## feature_branch/mutli_table_benchmark #497 +/- ##
========================================================================
+ Coverage 76.04% 76.39% +0.35%
========================================================================
Files 30 30
Lines 2371 2411 +40
========================================================================
+ Hits 1803 1842 +39
- Misses 568 569 +1
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
84f1f71 to
6d659cf
Compare
fealho
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good, I only left 1 question.
amontanez24
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! Thanks for the changes
| model.fit(data) | ||
|
|
||
| return model | ||
| self._internal_synthesizer = model |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we could just call it self._model
fealho
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Left a couple of minor docstring comments, besides that it looks good to me 👍
f5abf35
into
feature_branch/mutli_table_benchmark
Resolve #485
CU-86b7chzve
In this PR:
1 - I introduce a
_fit()method in theBaseSynthesizerclass. This method is similar to what we do in SDV; it fits all synthesizer parameters that need to be learned from the data and metadata.2 - I refactor
get_trained_synthesizerso it always returns an SDGym synthesizer object.3 - For SDV and third-party synthesizers, I defined the
_internal_synthesizerattribute, which references the actual SDV or third-party model that the SDGym synthesizer wraps around.4 - Define a
MultiTableBaselineSynthesizerto use thescaleparameter when sampling data.@amontanez24, @pvk-developer, @fealho, I ran an AWS test today from this branch with Variant to ensure everything still works as expected with these changes. The result is here