Skip to content

Conversation

@frances-h
Copy link
Contributor

This PR updates the data processor so that parameters used to instantiate the default transformer for an sdtype are used to create additional instances of the transformer.

@codecov
Copy link

codecov bot commented Mar 24, 2025

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 98.60%. Comparing base (b361f4b) to head (144e716).
Report is 1 commits behind head on main.

Additional details and impacted files
@@            Coverage Diff             @@
##             main    #2431      +/-   ##
==========================================
+ Coverage   98.58%   98.60%   +0.01%     
==========================================
  Files          59       59              
  Lines        6161     6167       +6     
==========================================
+ Hits         6074     6081       +7     
+ Misses         87       86       -1     
Flag Coverage Δ
integration 82.29% <100.00%> (+0.03%) ⬆️
unit 97.47% <100.00%> (+<0.01%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@frances-h frances-h force-pushed the dpdatasynthesizer-compatability-updates branch from ce0c294 to 9c607a9 Compare March 25, 2025 13:36
@frances-h frances-h marked this pull request as ready for review March 25, 2025 13:36
@frances-h frances-h requested a review from a team as a code owner March 25, 2025 13:36
@frances-h frances-h requested review from gsheni and removed request for a team March 25, 2025 13:36
@frances-h frances-h requested a review from pvk-developer March 25, 2025 13:36
Comment on lines 498 to 505
@staticmethod
def _get_transformer_kwargs(transformer):
args = inspect.getfullargspec(transformer.__init__).args[1:]
return {
key: getattr(transformer, key)
for key in args
if key != 'model_missing_values' and hasattr(transformer, key)
}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This method doesn't seem to belong here. We are trying to avoid creating static methods overall (following our standard). I feel like this method belongs more to RDT than the DataProcessor class. If we changed something, lets say, we deprecate one more argument, this change won't propagate here. However, if we were to add this to rdt we would require a new release etc.. and that is why I would like to suggest the following:

  • Move this function to the _utils.py in sdv (if possible).
  • File an issue in RDT regarding adding this functionality to the transformers.
  • File another issue in SDV to remove this utility function and use the transformer's method instead.

@frances-h frances-h force-pushed the dpdatasynthesizer-compatability-updates branch from 07215cf to ec3d3e3 Compare April 1, 2025 14:00
@frances-h frances-h force-pushed the dpdatasynthesizer-compatability-updates branch from ec3d3e3 to 144e716 Compare April 8, 2025 15:07
@frances-h frances-h merged commit 1114b57 into main Apr 8, 2025
45 checks passed
@frances-h frances-h deleted the dpdatasynthesizer-compatability-updates branch April 8, 2025 16:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants