-
Notifications
You must be signed in to change notification settings - Fork 49
Speed up calculation of the QualityReport #723
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## main #723 +/- ##
=======================================
Coverage 95.09% 95.09%
=======================================
Files 111 111
Lines 4299 4303 +4
=======================================
+ Hits 4088 4092 +4
Misses 211 211
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. |
sdmetrics/reports/single_table/_properties/column_pair_trends.py
Outdated
Show resolved
Hide resolved
e69e080
to
8990f35
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If it doesn't slow down the integration tests too much, could we add an integration test with a large dataset for multitable as well?
metric_parameters = {} | ||
if (metric == ContingencySimilarity) and ( | ||
max(len(data_real), len(data_synthetic)) > DEFAULT_NUM_ROWS_SUBSAMPLE | ||
): | ||
metric_parameters['num_rows_subsample'] = DEFAULT_NUM_ROWS_SUBSAMPLE | ||
|
||
return data_real, data_synthetic, metric, metric_parameters |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why not just do this in the calling method (_generate_details) instead of adding another return value to this method?
CU-86b3w6r0k
Resolve #718