New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Rewrite create_synthetic_data
#514
Conversation
Thanks, sounds great! +1 for removing that feature |
…t actually does not raise anymore already ;-)).
Any chance to get this merged within a couple of days? |
I can have a look at it on Monday. |
Codecov ReportPatch coverage:
Additional details and impacted files@@ Coverage Diff @@
## main #514 +/- ##
==========================================
- Coverage 94.20% 94.07% -0.13%
==========================================
Files 48 48
Lines 2983 2903 -80
==========================================
- Hits 2810 2731 -79
+ Misses 173 172 -1
☔ View full report in Codecov by Sentry. |
|
Somehow, the last commit @hmgaudecker, please check again before your review. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Very nice, much clearer than before! Just a smaller things.
Thanks for the feedback! I addressed all the comments now. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Excellent, thank you!
Only realised ex post that docs are failing because some notebook seems to be using the old interface of |
Thanks! See my comment above:
The documentation is currently incomplete. Easiest to fix is probably to revert to the state before the change of pre-commit hooks. So we shouldn't make any changes to the current version on main. So I think the options are:
|
Hope that last commit will do, this brought all print-statements back in and changed the hooks. |
Thanks for fixing the tutorials! I updated them now using the new interface of create_synthetic |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Excellent, thanks! Just a couple of typos / language details, will implement myself.
What problem do you want to solve?
create_synthetic_data
is currently overly complicated and slow (especially when many households are created at the same time). Main problem seems to be that a lot of pandas objects are created and concatenated.To fix the problems, I rewrote most of the code. Remarks:
dict of lists
:specs_constant_over_households
dict of lists of lists
:specs_heterogeneous
Todo
Collection of ideas
mietstufe: 2
instead ofmietstufe: [2, 2, 2, 2]