Skip to content

MultiIndex groupby bugs in 0.7.3, 0.8.0b1 and 0.8.0dev #1401

@ruidc

Description

@ruidc
import pandas
l = [['count', 'values'], ['to filter', '']]
midx = pandas.MultiIndex.from_tuples(l)
df = pandas.DataFrame([[1L, 'A']], columns=midx) #one line
print(df.groupby('to filter').groups)
#Out: {'to filter': [0L]} #was expecting 'A': [0L]
print(df.groupby([('to filter', '')]).groups)
#Out: {'to filter': [0L]} #was expecting same as above
df = pandas.DataFrame([[1L, 'A'], [2L, 'B']], columns=midx) #two lines, different group
print(df.groupby('to filter').groups)
#Out: {'A': [0L], 'B': [1L]} #fine
print(df.groupby([('to filter', '')]).groups)
#Out: {'': [1L], 'to filter': [0L]} #was expecting same as above
df = pandas.DataFrame([[1L, 'A'], [2L, 'A']], columns=midx) #two lines, same group
print(df.groupby('to filter').groups)
#Out: {'A': [0L, 1L]} #fine
print(df.groupby([('to filter', '')]).groups)
#Out: {'': [1L], 'to filter': [0L]} #was expecting same as above

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions