New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
optimize count
for groupby
#132
Comments
Another interesting optimization would be that the max needed value for count is the |
Yes, good idea. IIUC this would help where there's a multi-dimensional output. Otherwise it's a single allocation anyway... |
For transparency — I'm pushing these off for the moment. We're already 2x faster than pandas, and so we can wait until we have more adoption before adding optimizations. (Though if someone in the future reads this — contributions welcome!) |
Surfacing Stephan's comment from #13 (comment)
which I think is referring to
numbagg/numbagg/grouped.py
Line 20 in 592da9c
The text was updated successfully, but these errors were encountered: