Optimize ST_Simplify (and family) #480

Algunenano · 2019-09-23T18:07:56Z

First the benchmarks:

Dataset: 13 multipolygons with an average of 260843.07 points per geometry.
Method: Number of requests from pgbench running the same query with tolerance 1M, 1 and 0 during 20 seconds.

Trunk:

Tolerance 1000000: 302 its / 20 seconds.
Tolerance 1: 39 its / 20 seconds.
Tolerance 0: 39 its / 20 seconds.

PR:

Tolerance 1000000: 612 its / 20 seconds: 2x as fast as trunk
Tolerance 1: 93 its / 20 seconds: 2.38x as fast as trunk
Tolerance 0: 107 its / 20 seconds. 2.74x as fast as trunk

Now the meat:
There isn't an algorithm change to get this speed up, since at the end of the day Douglas-Peucker is still Douglas-Peucker, so how did I do it? (ordered by impact):

ptarray_dp_findsplit_in_place now contains the code distance2d_sqr_pt_seg inlined manually to avoid recalculations. As the segment AB is always the same during the loop, caching as much intermediate results as possible yields a big performance win. It also includes some tricks (using AB == -BA, avoiding the division, A/B > 1 == A > B if both are positive) that the compiler isn't clever enough to see.
ptarray_simplify_in_place has been rewritten to use an array of bools instead of the outlist + sort. The conditions and input for ptarray_dp_findsplit_in_place have also changed to, IMO, make the intent clearer. It now uses memcpy (without checking for i != j) which is slightly better but only noticeable after the rest of the improvements.
lwgeom_simplify_in_place now returns an int indicating whether the geometry has been modified or not and drops the bbox if it has, which allow us to avoid serialization in some cases.
lwgeom_simplify_in_place now stops simplifying a polygon once a ring is dropped, as any inner ring should be smaller than that according to the spec.
ST_Simplify now clones the gserialized input and calls lwgeom_simplify_in_place instead of getting a pointer and cloning the geometry, which was slower.

Functions affected by these changes: ST_Simplify directly and ST_Subdivide indirectly.

Other stuff:

distance2d_sqr_pt_seg: I've applied the same tricks as ptarray_dp_findsplit_in_place when possible and removed distance2d_sqr_seg to force callers to use this other, faster, function and only calculate the square root when necessary (which was almost never).
Functions (indirectly) affected by this: ST_Split, ST_Node, ST_OffsetCurve.

Note: I haven't cleanup the commits, so seeing changes as a whole is preferable.

This reverts commit c5fc253.

This reverts commit c3ddeef.

This reverts commit 9b281a8.

This reverts commit 2ff9c0c.

This reverts commit 034c69a.

This reverts commit 100a9a2.

…lations" This reverts commit 0bd3dea.

This reverts commit 038af48.

pramsey · 2019-09-23T18:14:27Z

Regards the performance numbers, cool. How does it work on more "normally" size data sets (npoints < 100)? Presumably well, because the no-op overhead is less, but good to confirm?

Algunenano · 2019-09-23T18:24:37Z

Regards the performance numbers, cool. How does it work on more "normally" size data sets (npoints < 100)? Presumably well, because the no-op overhead is less, but good to confirm?

Testing with a table of 11680 ST_MultiLineString (AVG(ST_NPoints == 43)):

Trunk:

Tolerance 1000000: 659 its / 20 seconds.
Tolerance 1: 39 its / 20 seconds.
Tolerance 0: 38 its / 20 seconds.

PR:

Tolerance 1000000: 651 its / 20 seconds: ~Same as trunk
Tolerance 1: 95 its / 20 seconds: 2.43x as fast as trunk
Tolerance 0: 109 its / 20 seconds. 2.86x as fast as trunk

Algunenano · 2019-09-23T18:28:40Z

Note that in the case where the performance is the same in trunk and with the changes, the % of time spent in lwgeom_simplify_in_place is 6.34%, the rest is spent reading from disk, joining parallel plans and serializing and deserializing the geometry, so not much to gain there by these changes.

dr-jts · 2019-09-23T19:40:37Z

Is the goal of this to make MVT generation faster? If so, would a simpler simplification algorithm be better? (I.e. simple decimation, or something based on a sliding window?

Algunenano · 2019-09-23T20:26:09Z

In this case it's about rendering faster, CARTO uses ST_Simplify from Mapnik (png and other formats) and also as part of ST_AsMVTGeom so it's a 2 for 1.

It might be possible to find a better simplification algorithm for MVTs but the simplification step only has a performance impact in some corner cases (big polygons with high simplification outside the tile box) and I hope the improvements both here and in St_RemoveRepeatedPoints will reduce them quite a bit, so the focus will be again in the whole clipping + validation (the slowest step in most cases).

pramsey · 2019-09-23T20:27:20Z

The simplify is generally preceded by a remove-repeated-points(tolerance), so to some extend that "rough filter" has already been applied.

Algunenano · 2019-09-24T09:13:35Z

So the numbers for the line benchmark were looking too familiar, so I rechecked it and I was using the old table (polygons) for tolerance 1 and 0 😓. Here are the proper numbers:

Trunk:

Tolerance 1000000: 685 its / 20 seconds.
Tolerance 1: 491 its / 20 seconds.
Tolerance 0: 488 its / 20 seconds.

PR:

Tolerance 1000000: 701 its / 20 seconds: ~Same as trunk
Tolerance 1: 658 its / 20 seconds: 1.34x as fast as trunk
Tolerance 0: 694 its / 20 seconds. 1.42x as fast as trunk

At this point the time spent in the simplification function is ~6%. In fact, the current high cost of the function is giving us a parallel plan which is slower, but the same happens with ST_RemoveRepeatedPoints. I'm considering reducing both their cost to LOW (100 vs 10000) as long as we don't have a system in place to scale the cost based on tuple / geometry size.

Komzpa · 2019-09-24T09:19:17Z

For tolerance 0, further opt is possible: in O(N) you loop over points and check if previous one is on the line between pre-previous and current. If it is, rewrite it with current, otherwise append. This will help Subdivide more, together with MVT.

Closes #4510 Closes postgis#480 git-svn-id: http://svn.osgeo.org/postgis/trunk@17821 b70326c6-7e19-0410-871a-916f4a2858ee

Algunenano added 30 commits September 17, 2019 18:21

St_Simplify: Remove clone

949086a

lwgeom_simplify_in_place: Return whether the geometry is modified

ef306ca

lwgeom_simplify_in_place: Shortcut polygon collapse

9f0f944

lwgeom_simplify_in_place: Bigger shortcut

5b7d7f2

lwgeom_from_gserialized2: Avoid a second pass to set SRID

9b281a8

Going mad: Introduce gserialized2_from_lwgeom_reuse

c3ddeef

LWGEOM_simplify2d: Use geometry_serialize_reuse

c5fc253

ptarray_simplify_in_place: Just memcpy

686fe65

St_simplify: Try inlining distance2d_sqr_pt_seg

9b59882

ptarray_simplify_in_place: Try an iterative approach

fc41a38

See if we can optimize boundaries

f872fde

simplify: Inline distance2d_sqr_pt_seg calculations

59263f4

ptarray_dp_findsplit_in_place: Avoid divisions in the loop

e2e511f

Inline even more

3f8dbb5

No need to keep the sign, it's always positive

44d3d9a

Optimization leak

80b98c7

Revert "LWGEOM_simplify2d: Use geometry_serialize_reuse"

031428c

This reverts commit c5fc253.

Revert "Going mad: Introduce gserialized2_from_lwgeom_reuse"

c8c35a5

This reverts commit c3ddeef.

Revert "lwgeom_from_gserialized2: Avoid a second pass to set SRID"

1f7d61e

This reverts commit 9b281a8.

St_simplify: remove unneded call to refresh bbox

0f2c167

Simplify: Keep most distant points until requirement is reached

d623179

Remove distance2d_sqr_pt and clean up comments

a6f7543

ptarray_simplify_in_place: Try with bit array

2ff9c0c

Revert "ptarray_simplify_in_place: Try with bit array"

baa546c

This reverts commit 2ff9c0c.

Remove comment

eb767c0

Try reverting to the old code

034c69a

Revert "Try reverting to the old code"

ae82e39

This reverts commit 034c69a.

Recover the stack allocation

100a9a2

Revert "Recover the stack allocation"

2ece588

This reverts commit 100a9a2.

distance2d_sqr_pt_seg: Help compiler

118b0c3

Algunenano added 12 commits September 23, 2019 16:38

Try removing redundant condition

c5facd8

ptarray_dp_findsplit_in_place: Expand and inline square calculations

0bd3dea

Revert "ptarray_dp_findsplit_in_place: Expand and inline square calcu…

112f879

…lations" This reverts commit 0bd3dea.

Check moving the division back

038af48

Revert "Check moving the division back"

8ab670c

This reverts commit 038af48.

Try to simplify ptarray_dp_findsplit_in_place calls

0931b45

Use memcmp to calculate A == B

bcfa82c

Revert previous and tune comparison

102ff7b

Comments and refactor

afe6b22

Try to keep track of the previous iterator

ac99869

Simplify: Try with the start - end stack

a4be251

Comments and variable names

35ff135

strk closed this in 86057e2 Sep 24, 2019

Algunenano pushed a commit to Algunenano/postgis that referenced this pull request Oct 2, 2019

Speed up ST_Simplify

012d378

Closes #4510 Closes postgis#480 git-svn-id: http://svn.osgeo.org/postgis/trunk@17821 b70326c6-7e19-0410-871a-916f4a2858ee

Algunenano deleted the speed_simplify branch November 15, 2019 15:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize ST_Simplify (and family) #480

Optimize ST_Simplify (and family) #480

Algunenano commented Sep 23, 2019 •

edited

pramsey commented Sep 23, 2019

Algunenano commented Sep 23, 2019 •

edited

Algunenano commented Sep 23, 2019

dr-jts commented Sep 23, 2019

Algunenano commented Sep 23, 2019

pramsey commented Sep 23, 2019 •

edited

Algunenano commented Sep 24, 2019

Komzpa commented Sep 24, 2019

Optimize ST_Simplify (and family) #480

Optimize ST_Simplify (and family) #480

Conversation

Algunenano commented Sep 23, 2019 • edited

pramsey commented Sep 23, 2019

Algunenano commented Sep 23, 2019 • edited

Algunenano commented Sep 23, 2019

dr-jts commented Sep 23, 2019

Algunenano commented Sep 23, 2019

pramsey commented Sep 23, 2019 • edited

Algunenano commented Sep 24, 2019

Komzpa commented Sep 24, 2019

Algunenano commented Sep 23, 2019 •

edited

Algunenano commented Sep 23, 2019 •

edited

pramsey commented Sep 23, 2019 •

edited