[QUESTION] Plans for an equivalent to pandas groupby? #341

bscully27 · 2020-04-01T16:19:19Z

I just started using this library, love it.

Quick question - are there any plans for an equivalent to pandas groupby?

Something like:
bn.group_by(matrix[:, :2]) .reduce(matrix[:, -1], np.sum)

qwhelan · 2020-04-02T04:26:57Z

To be honest, I hadn't considered it. Are you looking to avoid a pandas dependency or see this as a way to get more performance?

bscully27 · 2020-04-02T12:58:48Z

The latter, to get more performance. I believe pandas groupby has been optimized (not sure if via Cython) but a bottleneck C function would provide substantial speed gains.

qwhelan · 2020-04-02T15:58:46Z

Okay, thanks for clarifying. I'll keep this open in case someone would like to try out PRs in this vein, but probably won't take a more serious look at this myself until I clear out the backlog.

max-sixty · 2023-12-20T03:56:31Z

FYI for anyone looking for these — numbagg has groupby functions. It makes a good complement to bottleneck...

bscully27 added the bug label Apr 1, 2020

bscully27 assigned qwhelan Apr 1, 2020

qwhelan added enhancement and removed bug labels Apr 2, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[QUESTION] Plans for an equivalent to pandas groupby? #341

[QUESTION] Plans for an equivalent to pandas groupby? #341

bscully27 commented Apr 1, 2020

qwhelan commented Apr 2, 2020

Uh oh!

bscully27 commented Apr 2, 2020

Uh oh!

qwhelan commented Apr 2, 2020

Uh oh!

max-sixty commented Dec 20, 2023

Uh oh!

[QUESTION] Plans for an equivalent to pandas groupby? #341

[QUESTION] Plans for an equivalent to pandas groupby? #341

Comments

bscully27 commented Apr 1, 2020

qwhelan commented Apr 2, 2020

Uh oh!

bscully27 commented Apr 2, 2020

Uh oh!

qwhelan commented Apr 2, 2020

Uh oh!

max-sixty commented Dec 20, 2023

Uh oh!