Skip to content

Improve handling of multiple input files#1339

Merged
lonvia merged 1 commit intoosm2pgsql-dev:masterfrom
joto:multi-file-input
Nov 26, 2020
Merged

Improve handling of multiple input files#1339
lonvia merged 1 commit intoosm2pgsql-dev:masterfrom
joto:multi-file-input

Conversation

@joto
Copy link
Copy Markdown
Collaborator

@joto joto commented Nov 25, 2020

Osm2pgsql can handle any number of input files. The old code will just
read the files one after the other which will not work if there is any
overlap between the files, i.e. if the same object is in two input
files.

The new code will read the files in parallel. We construct a priority
queue feeding in the next objects from all input files, taking off the
"smallest" one by one. If the same object is in multiple files, we
only process it once.

If there is only a single input file a shortcut is taken which basically
behaves like the old code.

Note that the input files have to be from the same point in time. If
there are multiple versions of the same object in the input, this will
still not magically work.

This commit removes support for unsorted input files which were already
deprecated.

See #1167

This commit removes support for negative ids which were already
deprecated.

Fixes #1097

Osm2pgsql can handle any number of input files. The old code will just
read the files one after the other which will not work if there is any
overlap between the files, i.e. if the same object is in two input
files.

The new code will read the files in parallel. We construct a priority
queue feeding in the next objects from all input files, taking off the
"smallest" one by one. If the same object is in multiple files, we
only process it once.

If there is only a single input file a shortcut is taken which basically
behaves like the old code.

Note that the input files have to be from the same point in time. If
there are multiple versions of the same object in the input, this will
still not magically work.

This commit removes support for unsorted input files which were already
deprecated.

See osm2pgsql-dev#1167

This commit removes support for negative ids which were already
deprecated.

Fixes osm2pgsql-dev#1097
@lonvia lonvia merged commit ace7d22 into osm2pgsql-dev:master Nov 26, 2020
@joto joto deleted the multi-file-input branch November 27, 2020 07:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Deprecate use of negative IDs

2 participants