Skip to content

Conversation

@R-Palazzo
Copy link
Contributor

@R-Palazzo R-Palazzo commented Nov 17, 2025

Resolve #485
CU-86b7chzve

In this PR:
1 - I introduce a _fit() method in the BaseSynthesizer class. This method is similar to what we do in SDV; it fits all synthesizer parameters that need to be learned from the data and metadata.
2 - I refactor get_trained_synthesizer so it always returns an SDGym synthesizer object.
3 - For SDV and third-party synthesizers, I defined the _internal_synthesizer attribute, which references the actual SDV or third-party model that the SDGym synthesizer wraps around.
4 - Define a MultiTableBaselineSynthesizer to use the scale parameter when sampling data.

@amontanez24, @pvk-developer, @fealho, I ran an AWS test today from this branch with Variant to ensure everything still works as expected with these changes. The result is here

@R-Palazzo R-Palazzo self-assigned this Nov 17, 2025
@sdv-team
Copy link
Contributor

@codecov
Copy link

codecov bot commented Nov 17, 2025

Codecov Report

❌ Patch coverage is 96.46018% with 4 lines in your changes missing coverage. Please review.
✅ Project coverage is 76.39%. Comparing base (d4ae9e2) to head (48f4bf9).
⚠️ Report is 1 commits behind head on feature_branch/mutli_table_benchmark.

Files with missing lines Patch % Lines
sdgym/benchmark.py 80.00% 1 Missing ⚠️
sdgym/synthesizers/base.py 95.00% 1 Missing ⚠️
sdgym/synthesizers/generate.py 90.00% 1 Missing ⚠️
sdgym/synthesizers/sdv.py 87.50% 1 Missing ⚠️
Additional details and impacted files
@@                           Coverage Diff                            @@
##           feature_branch/mutli_table_benchmark     #497      +/-   ##
========================================================================
+ Coverage                                 76.04%   76.39%   +0.35%     
========================================================================
  Files                                        30       30              
  Lines                                      2371     2411      +40     
========================================================================
+ Hits                                       1803     1842      +39     
- Misses                                      568      569       +1     
Flag Coverage Δ
integration 62.30% <92.03%> (+0.69%) ⬆️
unit 71.46% <86.72%> (+0.43%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@R-Palazzo R-Palazzo force-pushed the issue-485-add-uniformsynthesizer branch from 84f1f71 to 6d659cf Compare November 18, 2025 09:21
@R-Palazzo R-Palazzo marked this pull request as ready for review November 18, 2025 12:58
@R-Palazzo R-Palazzo requested a review from a team as a code owner November 18, 2025 12:58
@R-Palazzo R-Palazzo removed the request for review from a team November 18, 2025 12:58
@R-Palazzo R-Palazzo changed the base branch from main to feature_branch/mutli_table_benchmark November 18, 2025 14:09
Copy link
Member

@fealho fealho left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, I only left 1 question.

Copy link
Contributor

@amontanez24 amontanez24 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! Thanks for the changes

model.fit(data)

return model
self._internal_synthesizer = model
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we could just call it self._model

Copy link
Member

@fealho fealho left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Left a couple of minor docstring comments, besides that it looks good to me 👍

@R-Palazzo R-Palazzo merged commit f5abf35 into feature_branch/mutli_table_benchmark Nov 25, 2025
44 checks passed
@R-Palazzo R-Palazzo deleted the issue-485-add-uniformsynthesizer branch November 25, 2025 17:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add multi-table UniformSynthesizer

6 participants