-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Devise a way to identify multiple sites on the same JUROL that have the same civic number but different street names #56
Comments
To even start, this will require us to resurrect a way to load the current data formats used in the now database-free geocoder environment into a database. |
Ok, will investigate ways of scripting this internally without a db. |
Nitpicky but important: they do not share the same "accessPoint". Their accessPoints are at the same location. I guess its fair to say they share the same "access point" (note the space and lack of camelCase). I make these accessPoints in the rangeGen, we don't really need to find them after the fact. They end up at the same location because they are "off the end" of the Street's centerline linestring. Certainly, MOST accessPoints that are offset from the end-point of the Street's lineSegment probably do not belong there. |
I suspect many of them are actually legitimate where a street ends in a cul-de-sac. I have a semi-working database environment in postgis that can ultimately be used to load the new data formats; it still needs more tweaking. About 10% of all APs share the same coordinate with at least one other AP (also included restriction to have same locality ID and street segment ID for a fast response). Note that subsites currently don't have their own APs because custodians don't supply it, so the counts are related to sites only. About 3500 records comprise the set of APs where 50 or more of them have a shared location. SELECT access_point_id, site_id, ap.locality_id, ap.street_segment_id, civic_number, count, ap.geometry FROM |
Perhaps restricting to APs that share the same point with three or more APs would narrow it down a bit. |
Cuts the number of cases in half, but there are still way too many to be useful for review. I could try cross-referencing to and ignoring dead end street cases, but this may still not be good enough. The worst offender is 143 APs at the same location around 1822 Purcell Way, North Vancouver. They are mid-block because all the custodian records for this block complex have the same site location. It isn't a dead end case, they just all extrapolate to the same mid-block location from their common starting geometry. |
Here's a revised query that ignores cases on dead-end segments. It cuts the number of cases in half, but with over 60k results is still too excessive for a broad review. As previously noted, looking for the largest clusters may provide the best starting point. SELECT access_point_id, site_id, ap.locality_id, ap.street_segment_id, civic_number, count, ap.geometry FROM |
Issue moved to bcgov/ols-geocoder #35 via ZenHub |
One way is to find accessPoints that are associated with more than one address. You need to exclude sub-units. For example, 1004, 1008,1104,1108, 1100 Pavillion-Clinton Rd, Clinton all share the same accessPoint
The text was updated successfully, but these errors were encountered: