Local country code lookup #6941

quincylvania · 2019-10-15T11:48:44Z

iD is and will be relying more and more on location-aware behavior. See #6513, #6479, #6712, #6836, and #6713, for example.

Right now we call out to nominatim every time we need the country code for a pair of coordinates. It'd be much more efficient and reliable to do this synchronously by querying a data file bundled with iD.

The trade-off here is that the size of the file would not be trivial. But since iD doesn't require high-precision results, we could generalize the data considerably

See previous discussion on this topic in the OSMUS Slack.

1ec5 · 2019-10-16T04:50:44Z

In addition to size, we should also keep an eye on runtime performance, considering that a single changeset can straddle a national border or jump around to different parts of the world. For example, which-polygon is very efficient for point-in-polygon lookups, but its memory usage is very sensitive to the complexity of the country polygons, and so might the time it takes to do the lookup.

The discussion in Slack points to Natural Earth as a possible source for the country geometries, but I don’t think we should use it as-is. For the features listed above, iD needs relatively high resolution along land borders but very low resolution along coastlines. For example, it’d be a good idea for the local lookup to unify Canada into a single polygon that includes all its islands. However, all of Detroit needs to be on the American side of the border and all of Mexicali on the Mexican side, with a tolerance of tens of meters perhaps, but not kilometers. A simple Douglas–Peucker simplification of the entire shapefile would result in the wrong address format and wrong language being preferred in neighborhoods on either side of the border.

Geofabrik’s data extract polygons are a good example of generalizing coastlines while retaining detail in land boundaries.

quincylvania · 2019-10-16T07:58:27Z

@1ec5 I totally agree. Thankfully file size and point-in-polygon performance correlate, so we can optimize for both. The raw Natural Earth dataset is much too detailed for this use case, even at 110m resolution. Coastline generalization should be a primary strategy, where islands like Iceland can be represented as simple rectangles or even triangles. For our purposes we don't need to know if a point is on land or not.

I was also thinking this would make for a good external module that other apps could also use.

bhousel · 2019-10-16T14:01:12Z

This is a great idea, and definitely something that's been on our radar for a while, and I'd use in a bunch of projects.

The closest thing we have right now is in the osm-community-index, which includes a bunch of country-level polygons, but also a bunch of other smaller ones. You can browse the osm-community-index data here on this nice map that @mikelmaron made: https://mikelmaron.github.io/map-demos/osm-community-index/

The polygon data by itself comes out to 238k minified. We are already using which-polygon in iD to index this data and also the editor-layer-index polygons. This approach is very fast because it precalculates bounding boxes and stores them in an rbush, so its only really doing the point-in-polygon tests for the polygons with bounds that actually intersect the point.

There are obviously some seams and places where we could improve a bunch on this. Part of the issue is that each geojson has been added independently by different contributors. Using an editor like iD but that's specifically built for generating a boundary mesh would be nice because then we could snap points together.

A handful of countries make the index much larger because of their complex borders. This is not intuitive (yes, Russia and France both have about equally complicated borders, Canada and US are less than half as complex). I tend to simplify a lot in sparsely populated areas. Not all of these have been hand-edited, so there is a lot of room for improvement.

There is also a stats command so I can keep track of the polygon sizes:

So.. My approach to doing this right would be:

make an iD fork that is specifically for editing GeoJSON.
use that to edit and refine the country mesh.

I'm working slowly towards laying the foundation that would let us do 1.

don-vip · 2019-10-16T21:59:43Z

You can also reuse the JOSM boundaries file: https://josm.openstreetmap.de/export/HEAD/josm/trunk/data/boundaries.osm (1.8Mb in .osm format, 5.4Mb in geojson format). It contains all countries, plus subdivisions for US, Canada, India and China:

See https://josm.openstreetmap.de/log/josm/trunk/data/boundaries.osm for the list of fixed issues since I introduced it 3 years ago.

quincylvania · 2019-10-17T10:11:41Z

@don-vip Thanks so much for the link! That's a great help, I think we'll be able to use it as a starting point.

🏎💨

quincylvania · 2019-10-23T10:12:45Z

Update: I've been working on this for the past week or so. Check out the package repo: https://github.com/ideditor/country-coder

Use country-coder to code addresses (re: #6941)

quincylvania added the chore Improvements to the iD development experience or codebase label Oct 15, 2019

quincylvania self-assigned this Oct 22, 2019

quincylvania mentioned this issue Oct 29, 2019

Warn if a brand tag is used in a non-matching country/region #6989

Open

quincylvania added this to the 2.16.1 milestone Nov 1, 2019

quincylvania added a commit that referenced this issue Nov 1, 2019

Add country-coder as a dependency

51dbdb4

Use country-coder to code addresses (re: #6941)

quincylvania closed this as completed in 8c07401 Nov 1, 2019

quincylvania added a commit that referenced this issue Nov 1, 2019

Use country-coder in v3-exclusive code (re: #6941)

bd52e0f

quincylvania added a commit that referenced this issue Nov 8, 2019

Replace mph.json file with country-coder implementation (re: #6941)

e8e95c6

quincylvania mentioned this issue Nov 11, 2019

Graceful degredation of fields when services like nominatim are unavailable #4198

Closed

quincylvania mentioned this issue Dec 3, 2019

Allow limiting fields to specific countries/regions #7085

Closed

quincylvania mentioned this issue Dec 23, 2019

Update to iD v2.17.0 openstreetmap/openstreetmap-website#2474

Merged

bhousel mentioned this issue Dec 9, 2020

GeoJSON for states osmlab/name-suggestion-index#4784

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Local country code lookup #6941

Local country code lookup #6941

quincylvania commented Oct 15, 2019

1ec5 commented Oct 16, 2019 •

edited

Loading

quincylvania commented Oct 16, 2019

bhousel commented Oct 16, 2019

don-vip commented Oct 16, 2019

quincylvania commented Oct 17, 2019

quincylvania commented Oct 23, 2019

Local country code lookup #6941

Local country code lookup #6941

Comments

quincylvania commented Oct 15, 2019

1ec5 commented Oct 16, 2019 • edited Loading

quincylvania commented Oct 16, 2019

bhousel commented Oct 16, 2019

don-vip commented Oct 16, 2019

quincylvania commented Oct 17, 2019

quincylvania commented Oct 23, 2019

1ec5 commented Oct 16, 2019 •

edited

Loading