'mode' not recognized by df.groupby().agg(), but pd.Series.mode works #11562

patricksurry · 2015-11-09T16:15:52Z

This works:

df = pd.DataFrame({'A': [1, 2, 1, 2, 1, 2, 3], 'B': [1, 1, 1, 2, 2, 2, 2]})
df.groupby('B').agg(pd.Series.mode)

but this doesn't:

df.groupby('B').agg('mode')

...
AttributeError: Cannot access callable attribute 'mode' of 'DataFrameGroupBy' objects, try using the 'apply' method

I thought all the series aggregate methods propagated automatically to groupby, but I've probably misunderstood?

The text was updated successfully, but these errors were encountered:

patricksurry · 2015-11-09T16:41:32Z

Hmm, I guess this might be because pd.Series.mode() returns a series, not a scalar. So maybe I need my own mode that decides how to handle the multi-modal case, e.g. pd.Series.mode().mean() or whatever?

TomAugspurger · 2015-11-09T16:46:26Z

might be because pd.Series.mode() returns a series, not a scalar

Correct. IIRC there's an older issue about this, where we decided to keep our behavior of always returning a series, and not adding a flag to reduce if possible. I could be misremembering though.

In these cases I'll usually just use scipy's

df.groupby('B').agg(lambda x: scipy.stats.mode(x)[0])

scipy.stats.mode returns a tuple of (mode, count) and we just want the mode.

jreback · 2016-07-26T22:25:11Z

Here's a mini-example; could be like .value_counts()

In [6]: df = DataFrame({'A' : [1,2,1,2], 'B' : [1,1,1,1]})

In [7]: df
Out[7]: 
   A  B
0  1  1
1  2  1
2  1  1
3  2  1

In [8]: df.groupby('A').B.value_counts()
Out[8]: 
A  B
1  1    2
2  1    2
Name: B, dtype: int64

In [9]: df.groupby('A').B.apply(lambda x: x.mode())
Out[9]: 
A   
1  0    1
2  0    1
Name: B, dtype: int64

kernc · 2017-04-07T11:35:10Z

What about when grouping Series?

I have no issue with .agg('mode') returning the first mode, if any, while issuing a warning if the modes were multuple.

gosuto-inzasheru · 2019-10-09T19:20:29Z

I encountered this problem and ended up settling for this:
.agg({'column': lambda x: pd.Series.mode(x)[0][0]})

But yes, I agree with @kernc, I would not mind .agg('mode') returning the first mode if multiple modes are returned.

mroeschke · 2021-04-21T04:53:07Z

xref #19254

rhshadrach · 2023-04-23T17:06:56Z

Closing as a duplicate of #19254

sinhrks mentioned this issue Jul 26, 2016

Why is there no mode method for groupby objects? #13809

Closed

jreback added Enhancement Groupby API Design labels Jul 26, 2016

jreback added this to the Next Major Release milestone Jul 26, 2016

jreback added Difficulty Intermediate labels Apr 7, 2017

jbrockmendel removed Difficulty Intermediate labels Oct 21, 2019

mroeschke removed the API Design label Apr 21, 2021

mroeschke removed this from the Contributions Welcome milestone Oct 13, 2022

jbrockmendel added the Apply Apply, Aggregate, Transform label Feb 11, 2023

rhshadrach closed this as completed Apr 23, 2023

rhshadrach added the Duplicate Report Duplicate issue or pull request label Apr 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

'mode' not recognized by df.groupby().agg(), but pd.Series.mode works #11562

'mode' not recognized by df.groupby().agg(), but pd.Series.mode works #11562

patricksurry commented Nov 9, 2015

patricksurry commented Nov 9, 2015

TomAugspurger commented Nov 9, 2015

jreback commented Jul 26, 2016

kernc commented Apr 7, 2017

gosuto-inzasheru commented Oct 9, 2019 •

edited

mroeschke commented Apr 21, 2021

rhshadrach commented Apr 23, 2023

'mode' not recognized by df.groupby().agg(), but pd.Series.mode works #11562

'mode' not recognized by df.groupby().agg(), but pd.Series.mode works #11562

Comments

patricksurry commented Nov 9, 2015

patricksurry commented Nov 9, 2015

TomAugspurger commented Nov 9, 2015

jreback commented Jul 26, 2016

kernc commented Apr 7, 2017

gosuto-inzasheru commented Oct 9, 2019 • edited

mroeschke commented Apr 21, 2021

rhshadrach commented Apr 23, 2023

gosuto-inzasheru commented Oct 9, 2019 •

edited