-
Notifications
You must be signed in to change notification settings - Fork 116
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FEATURE] add log_dtypes to pandas_utils #99
Comments
After thinking about it, we probably want to have these three:
Note too sure about |
light-weight, param-driven implementation of parallelism using multiproc
Is this still open? I'd be happy to take it |
It is still open, but let's first make sure we're clear on what will get implemented. Do you want to implement Relevant: there's a related issue too #371. |
I was thinking about both. I don't fully understand what is happening in #371, why is matplotlib generating warnings and what would the print step do? |
The print step would do a very similar thing to what |
This got adressed. |
I might also want to log the
dtypes
between each step in a pandas pipeline. It's probably best to add a separate logger for both thedtype
as well as the shape.Should the column names be seperate too? Maybe worth a discussion.
It would also be nice if these features were documented on the documentation page together with some other
pandas_utils
functions.The text was updated successfully, but these errors were encountered: