TST: skip tests of in overlay symmetric_difference under pandas 1.3.3 #2101

martinfleis · 2021-09-05T13:38:08Z

overlay based on symmetric_difference started failing on pandas master because we used an outer join based on columns full of NaNs. Refactoring the implementation to avoid it.

martinfleis · 2021-09-13T10:45:29Z

New pandas is out breaking even more environments.

@jorisvandenbossche can we get this in to make CI useful again?

jorisvandenbossche · 2021-09-13T14:40:24Z

Hmm, that's a pity I didn't look into it earlier, as this is a regression in pandas as far as I can see, and unfortunately it ended up in the 1.3.x release (and not just in master). See pandas-dev/pandas#43550

Now, I think this patch is a nice clean-up anyway :) (not fully sure why we were using an outer merge on keys we knew had nothing in common, so it's basically a concat on the other axis)

jorisvandenbossche · 2021-09-13T14:41:18Z

Now, I think this patch is a nice clean-up anyway :) (not fully sure why we were using an outer merge on keys we knew had nothing in common, so it's basically a concat on the other axis)

Actually, that doesn't handle duplicate column names .. (column names present in both left and right dataframe: with concat they are combined in a single column, with merge they are kept as separate columns, so that's probably the reason we were using merge and not concat ..)

martinfleis · 2021-09-13T19:56:47Z

Actually, that doesn't handle duplicate column names

True. I didn't check that and our CI is not doing that either.

I guess I will scrap this PR and skip the test for pandas 1.3.3 assuming it will get fixed there?

jorisvandenbossche · 2021-09-15T09:40:28Z

Yes, skipping the test for pandas 1.3.3 might be best for now (it should already be fixed on master). Hopefully pandas can do a 1.3.4 relatively soon.

jorisvandenbossche · 2021-09-15T09:41:23Z

And we should also add a test with a case that would be different between merge vs concat

martinfleis · 2021-09-15T22:07:27Z

I've reverted the code change back, added xfail marks where needed and changed the existing test of duplicated columns to run with all how options. That would catch the bug I had here before.

jorisvandenbossche · 2021-09-16T07:05:33Z

I pushed two changes:

xfailing the how fixture, which avoids having to add the if pandas_133 ... check to each test that uses it
I pinned to pandas 1.3.2 in one of the "latest" envs, so we have at least one build with recent pandas where we still test overlay (and that also avoids having to temporarily add the doctest skip)

jorisvandenbossche · 2021-09-16T08:55:28Z

OK, all passing so merging we can get our CI back green :) Thanks!

BUG: outer merge in overlay symmetric_difference

77d9fe7

martinfleis added this to the 0.10 milestone Sep 5, 2021

martinfleis added 2 commits September 5, 2021 14:44

fix column name

060bb95

potential old pandas fix

be5fc20

This was referenced Sep 9, 2021

Adding Geography support to to_postgis function #2096

Closed

ENH: Add storage_options to read_parquet #2107

Merged

jorisvandenbossche mentioned this pull request Sep 13, 2021

REGR: outer join with integer key and NaNs failing pandas-dev/pandas#43550

Closed

martinfleis added 4 commits September 15, 2021 22:55

revert code change

605e0b1

xfail tests for pandas 1.3.3

f19c396

fix test_crs_mismatch

f3361d3

properly tesst duplicated columns

8cc5abe

martinfleis changed the title ~~BUG: outer merge in overlay symmetric_difference~~ TST: skip tests of in overlay symmetric_difference under pandas 1.3.3 Sep 15, 2021

martinfleis and others added 4 commits September 15, 2021 23:34

xfail clip

8ccf700

skip for difference

1dc302e

skip doctests

7863b7d

xfail fixture instead of each test + pin to pd 1.3.2 in one env

969d45a

jorisvandenbossche merged commit 8caa139 into geopandas:master Sep 16, 2021

martinfleis deleted the overlay_symm branch September 16, 2021 08:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TST: skip tests of in overlay symmetric_difference under pandas 1.3.3 #2101

TST: skip tests of in overlay symmetric_difference under pandas 1.3.3 #2101

martinfleis commented Sep 5, 2021

martinfleis commented Sep 13, 2021 •

edited

jorisvandenbossche commented Sep 13, 2021

jorisvandenbossche commented Sep 13, 2021 •

edited

martinfleis commented Sep 13, 2021

jorisvandenbossche commented Sep 15, 2021

jorisvandenbossche commented Sep 15, 2021

martinfleis commented Sep 15, 2021

jorisvandenbossche commented Sep 16, 2021

jorisvandenbossche commented Sep 16, 2021

TST: skip tests of in overlay symmetric_difference under pandas 1.3.3 #2101

TST: skip tests of in overlay symmetric_difference under pandas 1.3.3 #2101

Conversation

martinfleis commented Sep 5, 2021

martinfleis commented Sep 13, 2021 • edited

jorisvandenbossche commented Sep 13, 2021

jorisvandenbossche commented Sep 13, 2021 • edited

martinfleis commented Sep 13, 2021

jorisvandenbossche commented Sep 15, 2021

jorisvandenbossche commented Sep 15, 2021

martinfleis commented Sep 15, 2021

jorisvandenbossche commented Sep 16, 2021

jorisvandenbossche commented Sep 16, 2021

martinfleis commented Sep 13, 2021 •

edited

jorisvandenbossche commented Sep 13, 2021 •

edited