DataFrame.join can not have datetime as column names #811

turkeytest · 2012-02-22T19:06:53Z

Issue:
For DataFrameA.join( DataFrameB , on = 'somecol' ), DataFrameB can not have datetime objects as column labels

Example:

from datetime import datetime
from pandas import DataFrame

str_dates = [ '20120209' , '20120222' ]
dt_dates = [ datetime(2012,2,9) , datetime(2012,2,22)]

A = DataFrame(str_dates , index = range(2) , columns = ['aa'] )

B = DataFrame([[1,2],[3,4]] , index = str_dates , columns = str_dates)
C = DataFrame([[1,2],[3,4]] , index = str_dates , columns = dt_dates )

works = A.join( B , on = 'aa' ) # works -- extra column labels are string
fails = A.join( C , on = 'aa' ) # fails -- extra column labels are datetime

adamklein · 2012-02-22T19:30:54Z

The problem is not that the columns cannot be datetime, but rather that strings and datetimes cannot be compared (and thus the resulting columns cannot be ordered). So, if you have on the other hand

A = DataFrame(str_dates , index = range(2) , columns = [datetime(2012,1,1)] )

it should all work. eg,

In [16]: A = DataFrame(str_dates , index = range(2) , columns = [datetime(2012,1,1)] )

In [17]: A
Out[17]: 
  2012-01-01
0   20120209
1   20120222

In [18]: A.join(C, on=datetime(2012,1,1))
Out[18]: 
  2012-01-01  2012-02-09  2012-02-22
0   20120209           1           2
1   20120222           3           4

Not sure what can be done about this one.

wesm · 2012-02-24T21:09:18Z

This is a bug in Index.union. If two indexes are monotonic but their elements are incomparable, self._inner_indexer will fail. Write a unit test with a union of these two indexes:

ipdb> self
Index([aa], dtype=object)
ipdb> other
Index([2012-02-09 00:00:00, 2012-02-22 00:00:00], dtype=object)

ipdb> self
Index([aa], dtype=object)
ipdb> other
Index([2012-02-09 00:00:00, 2012-02-22 00:00:00], dtype=object)

then add a workaround in case of TypeError from the Cython method

adamklein · 2012-02-24T22:24:29Z

Wes, please have a look. Falls back on slower non-monotonic methodology in intersect on TypeError (union works fine - I think you meant intersection?).

wesm · 2012-02-24T22:40:06Z

Yeah intersection, sorry. Let me look at the PR

wesm closed this as completed in 75bf87b Feb 24, 2012

adamklein mentioned this issue Feb 24, 2012

BUG: close #811, fix index.intersection where indices are incomparable #830

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DataFrame.join can not have datetime as column names #811

DataFrame.join can not have datetime as column names #811

turkeytest commented Feb 22, 2012

adamklein commented Feb 22, 2012

wesm commented Feb 24, 2012

adamklein commented Feb 24, 2012

wesm commented Feb 24, 2012

DataFrame.join can not have datetime as column names #811

DataFrame.join can not have datetime as column names #811

Comments

turkeytest commented Feb 22, 2012

adamklein commented Feb 22, 2012

wesm commented Feb 24, 2012

adamklein commented Feb 24, 2012

wesm commented Feb 24, 2012