BUG: regression in DataFrame.combine_first with integer columns (GH14687) #14886

jorisvandenbossche · 2016-12-15T10:53:24Z

…687)

jreback · 2016-12-15T11:30:15Z

pandas/core/frame.py

-                otherSeries = otherSeries.astype(new_dtype)
-            else:
+            new_dtype = this_dtype
+            if this_dtype != other_dtype:


use not is_dtype_equal

jreback · 2016-12-15T11:31:25Z

minor change, lgtm otherwise.

jorisvandenbossche · 2016-12-15T11:33:24Z

@sinhrks see #14687. The code I now removed was introduced in #13970 (see https://github.com/pandas-dev/pandas/pull/13970/files#diff-1e79abbbdd150d4771b91ea60a4e1cc7L3703).
Can you still remember the reason for the notnull(series).all()?

codecov-io · 2016-12-15T16:46:47Z

Current coverage is 85.30% (diff: 100%)

Merging #14886 into master will increase coverage by <.01%

@@             master     #14886   diff @@
==========================================
  Files           144        144          
  Lines         51004      51005     +1   
  Methods           0          0          
  Messages          0          0          
  Branches          0          0          
==========================================
+ Hits          43511      43512     +1   
  Misses         7493       7493          
  Partials          0          0

Powered by Codecov. Last update 5f889a2...7f325cd

sinhrks · 2016-12-16T03:59:04Z

@jorisvandenbossche Thx for the fix.

I think it is to keep caller's dtype if it doesn't have missing. If #14687 example can keep int dtype, the logic should be unnecessary.

jorisvandenbossche · 2016-12-16T09:43:18Z

I think it is to keep caller's dtype if it doesn't have missing.

OK, because there is a _possibly_downcast_to_dtype(arr, this_dtype) later on, it will be cast to int again if possible, and in this way also preserves the int, AFAIU

…687) (pandas-dev#14886)

…t with integer columns (GH14687) (pandas-dev#14886) (cherry picked from commit 992dfbc)

Version 0.19.2 * tag 'v0.19.2': (78 commits) RLS: v0.19.2 DOC: update release notes for 0.19.2 TST: skip gbq upload test as flakey DOC: clean-up v0.19.2 whatsnew DOC: update Pandas Cheat Sheet (GH13202) DOC: Pandas Cheat Sheet TST: matplotlib 2.0 fix in log limits for barplot (GH14808) (pandas-dev#14957) flake8 fix import Remove test - from 0.20.0 PR slipped in PERF: fix getitem unique_check / initialization issue cache and remove boxing (pandas-dev#14931) CLN: Resubmit of GH14700. Fixes GH14554. Errors other than Indexing… Clean up construction of Series with dictionary and datetime index BUG: .fillna() for datetime64 with tz is passing thru floats BUG: Patch read_csv NA values behaviour ENH: merge_asof() has type specializations and can take multiple 'by' parameters (pandas-dev#13936) [Backport pandas-dev#14886] BUG: regression in DataFrame.combine_first with integer columns (GH14687) (pandas-dev#14886) Fixed KDE Plot to drop the missing values (pandas-dev#14820) ENH: merge_asof() has left_index/right_index and left_by/right_by (pandas-dev#14253) (pandas-dev#14531) TST: correct url for test file on s3 (xref pandas-dev#14587) ...

* releases: (78 commits) RLS: v0.19.2 DOC: update release notes for 0.19.2 TST: skip gbq upload test as flakey DOC: clean-up v0.19.2 whatsnew DOC: update Pandas Cheat Sheet (GH13202) DOC: Pandas Cheat Sheet TST: matplotlib 2.0 fix in log limits for barplot (GH14808) (pandas-dev#14957) flake8 fix import Remove test - from 0.20.0 PR slipped in PERF: fix getitem unique_check / initialization issue cache and remove boxing (pandas-dev#14931) CLN: Resubmit of GH14700. Fixes GH14554. Errors other than Indexing… Clean up construction of Series with dictionary and datetime index BUG: .fillna() for datetime64 with tz is passing thru floats BUG: Patch read_csv NA values behaviour ENH: merge_asof() has type specializations and can take multiple 'by' parameters (pandas-dev#13936) [Backport pandas-dev#14886] BUG: regression in DataFrame.combine_first with integer columns (GH14687) (pandas-dev#14886) Fixed KDE Plot to drop the missing values (pandas-dev#14820) ENH: merge_asof() has left_index/right_index and left_by/right_by (pandas-dev#14253) (pandas-dev#14531) TST: correct url for test file on s3 (xref pandas-dev#14587) ...

BUG: regression in DataFrame.combine_first with integer columns (GH14…

c717bfe

…687)

jorisvandenbossche force-pushed the bug-combine-first branch from 9e8968a to c717bfe Compare December 15, 2016 10:54

jorisvandenbossche added Bug Regression Functionality that used to work in a prior pandas version labels Dec 15, 2016

jorisvandenbossche added this to the 0.19.2 milestone Dec 15, 2016

jreback reviewed Dec 15, 2016

View reviewed changes

jreback approved these changes Dec 15, 2016

View reviewed changes

use is_dtype_equal

7f325cd

jorisvandenbossche merged commit 992dfbc into pandas-dev:master Dec 16, 2016

ischurov pushed a commit to ischurov/pandas that referenced this pull request Dec 19, 2016

BUG: regression in DataFrame.combine_first with integer columns (GH14…

6f74275

…687) (pandas-dev#14886)

jorisvandenbossche added a commit to jorisvandenbossche/pandas that referenced this pull request Dec 24, 2016

[Backport pandas-dev#14886] BUG: regression in DataFrame.combine_firs…

a509172

…t with integer columns (GH14687) (pandas-dev#14886) (cherry picked from commit 992dfbc)

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: regression in DataFrame.combine_first with integer columns (GH14687) #14886

BUG: regression in DataFrame.combine_first with integer columns (GH14687) #14886

jorisvandenbossche commented Dec 15, 2016

jreback Dec 15, 2016

jreback commented Dec 15, 2016

jorisvandenbossche commented Dec 15, 2016

codecov-io commented Dec 15, 2016

sinhrks commented Dec 16, 2016

jorisvandenbossche commented Dec 16, 2016

BUG: regression in DataFrame.combine_first with integer columns (GH14687) #14886

BUG: regression in DataFrame.combine_first with integer columns (GH14687) #14886

Conversation

jorisvandenbossche commented Dec 15, 2016

jreback Dec 15, 2016

Choose a reason for hiding this comment

jreback commented Dec 15, 2016

jorisvandenbossche commented Dec 15, 2016

codecov-io commented Dec 15, 2016

Current coverage is 85.30% (diff: 100%)

sinhrks commented Dec 16, 2016

jorisvandenbossche commented Dec 16, 2016