Fix polygon merging failure. #1848

zerebubuth · 2019-02-27T17:21:18Z

Sometimes the GEOS/Shapely polygon merging routine unary_union fails with a ValueError and message "No Shapely geometry can be created from null value." This means that the union failed, and GEOS is passing back a null value to Shapely. This was repeatably happening with tile 11/1052/672 near Monnickendam in the Netherlands when trying to merge together a lot of very small and closely-spaced field polygons.

The problem appears to be one of numerical instability; because the polygons are very closely spaced, some intersection tests are coming up with nonsensical answers.

I did initially try to fix this by isolating the problem, recursively attempting to merge subsets of the list of polygons until either some small set was unmergeable or the whole thing worked. Unfortunately, this turned out to not be robust because the issue is merging between two polygons, not in the polygon itself. This means that the issue simply resurfaced when aggregating the results of the recursive merge.

An alternative way of fixing this is to buffer out a small distance, so that these points of numerical instability are more rare (although we can't guarantee to get rid of them entirely). This seems to be much more stable in practice.

This PR detects failure of the merge and tries again with a small (1/16th pixel) buffer. If that also fails, then it bails and returns an empty result set. The reasoning behind that is that I think we'd prefer to have a tile, even if it's missing some data, than not have a tile at all.

Sometimes the GEOS/Shapely polygon merging routine `unary_union` fails with a `ValueError` and message "No Shapely geometry can be created from null value." This means that the union failed, and GEOS is passing back a null value to Shapely. This was repeatably happening with tile `11/1052/672` near Monnickendam in the Netherlands when trying to merge together a lot of very small and closely-spaced field polygons. The problem appears to be one of numerical instability; because the polygons are very closely spaced, some intersection tests are coming up with nonsensical answers. One way of fixing this is to `buffer` out a small distance, so that these points of numerical instability are more rare (although we can't guarantee to get rid of them entirely). This patch detects failure of the merge and tries again with a small (1/16th pixel) buffer. If that also fails, then it bails and returns an empty result set. The reasoning behind that is that I think we'd prefer to have a tile, even if it's missing some data, than not have a tile at all.

iandees

Looks good, with one possible efficiency catch.

iandees · 2019-02-27T17:25:19Z

vectordatasource/transform.py

+
+        # don't buffer by the full pixel, instead choose a smaller value that
+        # shouldn't be noticable.
+        buffer_size = tolerance / 16.0


Would it make sense to pre-divide the tolerance value so that python doesn't have to do floating point divisions for every pass through here?

That's a good idea, thanks.

This method is called pretty dynamically; the caller can configurably call either this, _merge_polygons_with_buffer or _merge_lines and they all do different things with tolerance. We could have a tolerance and tolerance_16th stored in the RecursiveMerger object, but that seems a little inelegant.

Also, I think the time saving from not repeating the division is probably dwarfed by the attempt (and failure, to get to this branch) to unary_union the list of polygons. For example, the 11/1052/672 tile that was failing, had over 4,000 polygons in that list!

What do you think?

I was thinking this was called more frequently. You're right – let's keep it clean and as-is.

zerebubuth requested review from rmarianski and iandees February 27, 2019 17:21

iandees approved these changes Feb 27, 2019

View reviewed changes

zerebubuth merged commit 4ffd95a into master Feb 27, 2019

zerebubuth deleted the zerebubuth/fix-polygon-merge-failure branch February 27, 2019 17:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix polygon merging failure. #1848

Fix polygon merging failure. #1848

zerebubuth commented Feb 27, 2019

iandees left a comment

iandees Feb 27, 2019

zerebubuth Feb 27, 2019

iandees Feb 27, 2019

Fix polygon merging failure. #1848

Fix polygon merging failure. #1848

Conversation

zerebubuth commented Feb 27, 2019

iandees left a comment

Choose a reason for hiding this comment

iandees Feb 27, 2019

Choose a reason for hiding this comment

zerebubuth Feb 27, 2019

Choose a reason for hiding this comment

iandees Feb 27, 2019

Choose a reason for hiding this comment