BUG: DataFrame with scalar tzaware Timestamp #44518

jbrockmendel · 2021-11-18T21:11:02Z

closes BUG: Regression - AmbiguousTimeError creating DataFrame #42505
tests added / passed
Ensure all linting tests pass, see here for how to run them
whatsnew entry

@simonjayhawkins where does the whatsnew for this go?

simonjayhawkins · 2021-11-19T10:33:22Z

@simonjayhawkins where does the whatsnew for this go?

wdyt?

sequence_to_dt64ns is called from sequence_to_datetimes which is called from maybe_infer_to_datetimelike and maybe_cast_to_datetime in pandas/core/dtypes/cast.py so this change is not narrowly targeted to the reported issue or reverting to previous behavior.

I have no doubt that this is the correct long term fix, option 2 in #42505 (comment) but not sure if other interactions would expose other latent bugs.

This is fine for 1.4 IMO, but would a more narrowly targeted fix for 1.3.5 be safer, option 3 in #42505 (comment)

We must have little testing of scalars passed to DatetimeArray._from_sequence. Is there a risk that fixing this properly here change any existing behavior?

jbrockmendel · 2021-11-19T17:34:53Z

wdyt?

I guess for now put it in the 1.4 file and discuss whether to backport?

We must have little testing of scalars passed to DatetimeArray._from_sequence. Is there a risk that fixing this properly here change any existing behavior?

Only incorrect behavior AFAICT.

but would a more narrowly targeted fix for 1.3.5 be safer, option 3 in #42505 (comment)

I'm not clear on how we can get more targeted than this.

jreback · 2021-11-20T16:06:55Z

this is fine, can you add a whatsnew for 1.3.5

jreback · 2021-11-20T21:12:17Z

@meeseeksdev backport 1.3.x

jreback · 2021-11-20T21:13:00Z

@jbrockmendel if you can push the backport :-<>

simonjayhawkins · 2021-11-21T10:10:33Z

but would a more narrowly targeted fix for 1.3.5 be safer, option 3 in #42505 (comment)

I'm not clear on how we can get more targeted than this.

I was thinking special case in construct_1d_arraylike_from_scalar. The return type from infer_dtype_from_scalar is now a Timestamp whereas it was a int in pandas 1.2.5

So we then have the block... if isinstance(dtype, ExtensionDtype) where we pass the value onto DataTimeArray._from_sequence, so my thinking was to pass value.value if is instance of Timestamp to restore the 1.2.5 behavior explicitly without touching DataTimeArray._from_sequence.

so I was thinking something like...

diff --git a/pandas/core/dtypes/cast.py b/pandas/core/dtypes/cast.py
index 6d5162f3fe..a02e8e085c 100644
--- a/pandas/core/dtypes/cast.py
+++ b/pandas/core/dtypes/cast.py
@@ -1900,6 +1900,8 @@ def construct_1d_arraylike_from_scalar(
 
     """
 
+    from pandas.core.arrays import DatetimeArray
+
     if dtype is None:
         try:
             dtype, value = infer_dtype_from_scalar(value, pandas_dtype=True)
@@ -1908,6 +1910,8 @@ def construct_1d_arraylike_from_scalar(
 
     if isinstance(dtype, ExtensionDtype):
         cls = dtype.construct_array_type()
+        if issubclass(cls, DatetimeArray) and isinstance(value, Timestamp):
+            value = value.value
         subarr = cls._from_sequence([value] * length, dtype=dtype)
 
     else:

with an appropriate code comment about the hack.

The non-extension array block seems to include some special handling for dtype.kind in ["M", "m"] to unbox the values so this location for a fix, unboxing the values from a TimeStamp is maybe not so strange after all?

We must have little testing of scalars passed to DatetimeArray._from_sequence. Is there a risk that fixing this properly here change any existing behavior?

Only incorrect behavior AFAICT.

yes. any change in behavior from doing the proper fix for 1.3.5 can probably be put under the "bug fix" umbrella, so maybe this is fine.

…e release note

BUG: DataFrame with scalar tzaware Timestamp

357f5db

simonjayhawkins added Constructors Series/DataFrame/Index/pd.array Constructors Regression Functionality that used to work in a prior pandas version Timezones Timezone data dtype labels Nov 19, 2021

Merge branch 'master' into bug-42505

7590d60

jreback added this to the 1.3.5 milestone Nov 20, 2021

jbrockmendel added 2 commits November 20, 2021 11:07

Merge branch 'master' into bug-42505

c4ae639

whatsnew

8ddfa93

jreback merged commit a327ad1 into pandas-dev:master Nov 20, 2021

This comment has been minimized.

Sign in to view

lumberbot-app bot added the Still Needs Manual Backport label Nov 20, 2021

This comment has been minimized.

Sign in to view

jbrockmendel added a commit to jbrockmendel/pandas that referenced this pull request Nov 20, 2021

BUG: DataFrame with scalar tzaware Timestamp (pandas-dev#44518)

566bb67

jbrockmendel added a commit to jbrockmendel/pandas that referenced this pull request Nov 20, 2021

BUG: DataFrame with scalar tzaware Timestamp (pandas-dev#44518)

6b52a9f

jbrockmendel deleted the bug-42505 branch November 20, 2021 23:15

simonjayhawkins mentioned this pull request Nov 21, 2021

Backport PR #44518 on branch 1.3.x (BUG: DataFrame with scalar tzaware Timestamp) #44546

Merged

simonjayhawkins removed the Still Needs Manual Backport label Nov 21, 2021

simonjayhawkins added a commit to simonjayhawkins/pandas that referenced this pull request Nov 21, 2021

DOC: follow-up to pandas-dev#44518, move release note

cd50376

simonjayhawkins mentioned this pull request Nov 21, 2021

DOC: follow-up to #44518, move release note #44557

Merged

simonjayhawkins pushed a commit that referenced this pull request Nov 21, 2021

BUG: DataFrame with scalar tzaware Timestamp (#44518) (#44546)

4ffccaf

simonjayhawkins added a commit that referenced this pull request Nov 21, 2021

DOC: follow-up to #44518, move release note (#44557)

e1fbd3c

simonjayhawkins added a commit to simonjayhawkins/pandas that referenced this pull request Nov 21, 2021

Backport PR pandas-dev#44557: DOC: follow-up to pandas-dev#44518, mov…

12f1efa

…e release note

simonjayhawkins added a commit that referenced this pull request Nov 21, 2021

Backport PR #44557: DOC: follow-up to #44518, move release note (#44558)

6462e54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: DataFrame with scalar tzaware Timestamp #44518

BUG: DataFrame with scalar tzaware Timestamp #44518

jbrockmendel commented Nov 18, 2021

simonjayhawkins commented Nov 19, 2021

jbrockmendel commented Nov 19, 2021

jreback commented Nov 20, 2021

This comment has been minimized.

jreback commented Nov 20, 2021

This comment has been minimized.

jreback commented Nov 20, 2021

simonjayhawkins commented Nov 21, 2021

BUG: DataFrame with scalar tzaware Timestamp #44518

BUG: DataFrame with scalar tzaware Timestamp #44518

Conversation

jbrockmendel commented Nov 18, 2021

simonjayhawkins commented Nov 19, 2021

jbrockmendel commented Nov 19, 2021

jreback commented Nov 20, 2021

This comment has been minimized.

jreback commented Nov 20, 2021

This comment has been minimized.

jreback commented Nov 20, 2021

simonjayhawkins commented Nov 21, 2021