-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
value_counts is slow for nested columns #115
Comments
Introduced in #100 |
The slowest are
|
If there's not a cache, seems like you're calculating |
You could use a generator to reduce the number of calls to the function. |
…his should help with scrapinghub#115
@victor-torres It doesn't make much difference since |
No major comments here. |
…his should help with scrapinghub#115
I thought we can put
But we need to find a solution since right now Without nested data it takes around 30 seconds. |
More data to follow
Because
value_counts
is slow, any big df makes report_all awfully slow.or make a parameter like (fast=True)
The text was updated successfully, but these errors were encountered: