Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

BUG: groupby with resample using on parameter errors when selecting column to apply function #17813

Closed
tdpetrou opened this Issue Oct 7, 2017 · 1 comment

Comments

Projects
None yet
2 participants
@tdpetrou
Copy link
Contributor

commented Oct 7, 2017

Code Sample, a copy-pastable example if possible

>>> df = pd.DataFrame({'id':list('aabbb'), 
                       'date':pd.date_range('1-1-2016', periods=5), 
                       'data':1})
>>> df.groupby('id').resample('2D', on='date').sum() #correct
               data
id date            
a  2016-01-01     2
b  2016-01-03     2
   2016-01-05     1

>>> df.set_index('date').groupby('id').resample('2D')['data'].sum() #correct
id  date      
a   2016-01-01    2
b   2016-01-03    2
    2016-01-05    1
Name: data, dtype: int64

>>> df.groupby('id').resample('2D', on='date')['data'].sum() #error
KeyError: 'The grouper name date is not found'

Problem description

Using a groupby followed by resample with the datetime not in the index forces the use of the on parameter. Unfortunately, this produces a key error whenever selecting a column to apply a function to.

Expected Output

Same as second one from above.

Output of pd.show_versions()

INSTALLED VERSIONS

commit: None
python: 3.6.1.final.0
python-bits: 64
OS: Darwin
OS-release: 15.6.0
machine: x86_64
processor: i386
byteorder: little
LC_ALL: None
LANG: en_US.UTF-8
LOCALE: en_US.UTF-8

pandas: 0.20.3
pytest: 3.0.7
pip: 9.0.1
setuptools: 35.0.2
Cython: 0.25.2
numpy: 1.13.1
scipy: 0.19.0
xarray: None
IPython: 6.0.0
sphinx: 1.5.5
patsy: 0.4.1
dateutil: 2.6.1
pytz: 2017.2
blosc: None
bottleneck: 1.2.0
tables: 3.4.2
numexpr: 2.6.2
feather: None
matplotlib: 2.0.2
openpyxl: 2.4.7
xlrd: 1.0.0
xlwt: 1.2.0
xlsxwriter: 0.9.6
lxml: 3.7.3
bs4: 4.6.0
html5lib: 0.9999999
sqlalchemy: 1.1.9
pymysql: None
psycopg2: None
jinja2: 2.9.6
s3fs: None
pandas_gbq: None
pandas_datareader: 0.3.0.post

@tdpetrou tdpetrou changed the title groupby with resample with date not in index errors when selecting column to apply function groupby with resample using on parameter errors when selecting column to apply function Oct 7, 2017

@jreback jreback added this to the Next Major Release milestone Oct 8, 2017

@jreback

This comment has been minimized.

Copy link
Contributor

commented Oct 8, 2017

similar to #15072 & #16766 .

a pull request is welcome to fix.

@jreback jreback added the Bug label Oct 8, 2017

@jreback jreback changed the title groupby with resample using on parameter errors when selecting column to apply function BUG: groupby with resample using on parameter errors when selecting column to apply function Oct 8, 2017

@jreback jreback modified the milestones: Next Major Release, 0.23.0 Jan 30, 2018

@jreback jreback closed this in 5b58a20 Feb 5, 2018

harisbal pushed a commit to harisbal/pandas that referenced this issue Feb 28, 2018

BUG: groupby with resample using on parameter errors when selecting c…
…olumn to apply function

closes pandas-dev#17813

Author: discort <odiscort@gmail.com>

Closes pandas-dev#19433 from discort/fix_17813 and squashes the following commits:

2f25d40 [discort] Fixed bug in df.resample using 'on' parameter
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
You can’t perform that action at this time.