Parser optimization: Seperate filtering and property handling and the two passes in OSMReader #49

NopMap · 2013-05-25T09:00:30Z

Currently the filtering for routable ways and the handling of more detailed properties is mixed in AcceptWays. Also, the code is executed twice, during both passes. This will become worse as the parsing becomes more detailed.

Suggestion:

seperate filtering and property analysis in two methods
run filtering only in pass1
run property analysis only in pass2
store the result of filtering from pass1 and re-use it in pass2

The result of the filtering should be be stored per way with one bit for each allowed mode of travel and re-used in pass 2 to skip the redundant filtering. This would speed up parsing on expense of memory. (optional switch?)

karussell · 2013-05-25T14:34:16Z

This would speed up parsing on expense of memory.

I don't like this idea. For me RAM is more important than speed, e.g. if you want to import planet.osm you already need 20GB and this is increasing by 40% per year due to OSM contributions! So, before increasing the RAM usage we should make it possible to have a fast but on-disc solution for GHLongIntBTree. But I didn't find a good solution. E.g. MapDB was too slow. I'll experiment with Lucene again as we probably need it elsewhere.

Of course you can make this speed-up optional, e.g. in a new OSMReaderHelper subclass or whatever

NopMap · 2013-05-25T16:23:56Z

It would probably be worthwhile to have a technology that will run with limited memory for any size of file. In my map compiler I had a similar problem caching the nodes. I wrote my own storage class that is organized in pages which are loaded from disk, with a maximum number that can be loaded at one time.

Until then it can't hurt to make this optional, controlled by a property. If there is no cached data available, the ways are parsed again just like it happens now. Nothing lost. If you have enough memory, it should run faster.

NopMap · 2013-05-27T22:18:11Z

This can be closed.

karussell closed this as completed May 28, 2013

karussell added a commit that referenced this issue Apr 11, 2018

improved matching algorithm via hidden markov, comes with simple UI, #49

652931d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parser optimization: Seperate filtering and property handling and the two passes in OSMReader #49

Parser optimization: Seperate filtering and property handling and the two passes in OSMReader #49

NopMap commented May 25, 2013

karussell commented May 25, 2013

NopMap commented May 25, 2013

NopMap commented May 27, 2013

Parser optimization: Seperate filtering and property handling and the two passes in OSMReader #49

Parser optimization: Seperate filtering and property handling and the two passes in OSMReader #49

Comments

NopMap commented May 25, 2013

karussell commented May 25, 2013

NopMap commented May 25, 2013

NopMap commented May 27, 2013