Don't preflight zoom levels for potential as-needed dropping #40

e-n-f · 2022-11-28T23:15:35Z

Instead, go ahead and write out the zoom level, and if any of the tiles didn't work, clear out the zoom level and do it again. This should make the common case where the data fits and nothing has to be done as needed much faster.

Tippecanoe previously would preflight each zoom level to determine if all the tiles were going to be small enough, or, if they weren't, what feature-dropping or -coalescing threshold would allow the entire zoom level to succeed, before doing another pass through the zoom level to actually write out the tiles. Now it writes out the tiles that work the first time through, and erases the zoom level from the mbtiles output and tries the zoom level again only if it actually needs to change something to make the zoom level work.

Sample time savings (no features being dropped):

tippecanoe --no-tile-size-limit --no-feature-limit --drop-densest-as-needed tl_2021_06037_roads.shp.json

before: 56.25s user 3.31s system 239% cpu 24.825 total
after: 31.96s user 2.10s system 196% cpu 17.338 total

Sample time savings (features being dropped from some zoom levels):

tippecanoe --drop-densest-as-needed tl_2021_06037_roads.shp.json

before: 62.82s user 3.37s system 208% cpu 31.732 total
after: 49.42s user 2.72s system 190% cpu 27.440 total

No intended change without as-needed:

tippecanoe --no-tile-size-limit --no-feature-limit tl_2021_06037_roads.shp.json

before: 31.35s user 1.95s system 213% cpu 15.587 total
after: 30.83s user 2.06s system 224% cpu 14.622 total

bdon · 2022-11-29T03:46:37Z

In #3 I write all of the tiles to a tempfile (protected by mutex) as they're generated, and also add an entry (zxy, offset/len) to a vector.

It sounds like I would need to slightly revise this by using a std::map for the entries so I can overwrite an entry if the zoom level is rewritten. That means there would be dead unreferenced data in the tempfile, which would be skipped over when the tempfile is turned into the final archive. Does that approach sound reasonable?

One concern is if entire deep zoom levels are being re-done, it will leave a massive amount of dead data in the tempfile, possibly consuming much more disk space than the final archive.

e-n-f · 2022-11-29T18:18:54Z

@bdon Can you truncate your temporary file to the length it was at the end of the last successfully-generated zoom level, since it will always be entire zoom levels that are retried?

But if that isn't possible, I think it is OK to leave unreferenced data in a temporary file, even though it might add up.

migurski

Thanks for the detailed explanation!

bdon · 2022-12-01T04:03:35Z

@bdon Can you truncate your temporary file to the length it was at the end of the last successfully-generated zoom level, since it will always be entire zoom levels that are retried?

But if that isn't possible, I think it is OK to leave unreferenced data in a temporary file, even though it might add up.

I opened #43 as a simpler alternative to having PMTiles as a sibling output. Truncating the tempfile sounds tricky with multiple threads involved, so I'd prefer that approach; otherwise I'd prefer to leave the dead data in the tempfile.

e-n-f added 6 commits September 27, 2022 09:11

Working on eliminating preflighting

4ceb2f6

Merge branch 'main' into no-preflight

052f344

Adjust for the map/images schema change

3b8db81

Avoid generating duplicate tiles with the detail reduction strategy

a54ac81

Do error checking if tiles in a directory can't be written

d22d547

Update changelog and version

b189159

Revert unintentional code reordering

074ca6f

Neglected to add the exit on error here

de19263

migurski approved these changes Nov 29, 2022

View reviewed changes

e-n-f merged commit 5c647cd into main Nov 29, 2022

e-n-f deleted the no-preflight branch November 29, 2022 20:42

himikof mentioned this pull request Mar 22, 2023

Proposal for MBTiles SQLite efficiency improvements #84

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't preflight zoom levels for potential as-needed dropping #40

Don't preflight zoom levels for potential as-needed dropping #40

e-n-f commented Nov 28, 2022 •

edited

Loading

bdon commented Nov 29, 2022

e-n-f commented Nov 29, 2022 •

edited

Loading

migurski left a comment

bdon commented Dec 1, 2022

Don't preflight zoom levels for potential as-needed dropping #40

Don't preflight zoom levels for potential as-needed dropping #40

Conversation

e-n-f commented Nov 28, 2022 • edited Loading

Sample time savings (no features being dropped):

Sample time savings (features being dropped from some zoom levels):

No intended change without as-needed:

bdon commented Nov 29, 2022

e-n-f commented Nov 29, 2022 • edited Loading

migurski left a comment

Choose a reason for hiding this comment

bdon commented Dec 1, 2022

e-n-f commented Nov 28, 2022 •

edited

Loading

e-n-f commented Nov 29, 2022 •

edited

Loading