-
-
Notifications
You must be signed in to change notification settings - Fork 224
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DISCUSSION: What open datasets do you want in Pelias? #254
Comments
For reference here's the datasets we currently use:
|
Also here are a few open sources we're tracking now. No promises if/when they'll all make it into the service (and if it makes sense to route them through other, broader projects, like OpenAddresses or Who's on First), but we want to be transparent in what we've been thinking about.
|
It's GB postal codes. The Northern Ireland postcodes are managed by https://www.dfpni.gov.uk/publications/lps-copyright-licensing-digital-application-forms (kind of a separate Ordinance Survey). Those are reselling Royalmail data and require a commercial license. I know, pretty annoying, especially since you can download the NI postcodes just fine. For Canadian postcodes watch http://geocoder.ca/?sued=1#updates (and/or add to their legal defense fund) |
Totally, thank you for catching the NI issue, @mtmail. And we'll keep our eyes on geocoder.ca. |
I would like to propose that we build an importer for All The Places. cc @iandees |
Hi @orangejulius, I noticed your Docker project for Australia. The Australian Government also makes a Geocoded National Address File each quarter which may be useful to develop an importer for. |
Hey @hughcameron, Pelias uses openaddresses which contains that curated Australia-wide source, kept up-to-date by the inimitable @andrewharvey. Thanks for your interest in Pelias, we love new data sources! |
@hughcameron, yup, what @trescube said. The excellent coverage via G-NAF data is part of the reason I made the Australia Docker project. We plan to use Australia as a mid-sized build for testing pelias going forward because we know there is great address coverage. If you have a chance to try it out let me know! |
Hey @orangejulius & @trescube - thanks for letting me know. I got the Australia project up and running earlier in the week using forward geocoding for validation and parsing - really powerful stuff. Great work! |
I suggest two or three public sources: |
I think openaddresses will be happy to have this source : https://opendata-ajuntament.barcelona.cat/data/ca/dataset/taula-direle |
Feel free to open a PR! |
In france, you have the official https://adresse.data.gouv.fr/ too |
|
It would be nice to have an official importer for gtfs-data. |
I noticed that the Canadian Postcodes are not really good in pelias yet. This company is making monthly updated canadian zip code data (890.000 entries right now) avaiable for free: https://www.serviceobjects.com/blog/unique-us-canadian-zip-code-files-available-download/ This would make Canadian address data much more accurate. Is the census data from here used for canada already? |
Hi @gelsas, It appears that the license for that zip file from ServiceObjects is effectively proprietary, and certainly not suitable for inclusion by default in Pelias. It's likely anyone using Pelias for their own needs wouldn't want to use it either. |
Is this included in pelias ? (This is really important question for me, since I am considering setting this up: https://github.com/mountain-pass/addressr if the data is not yet in pelias) |
Is this data sets already used in pelias: Is NZ data included already? |
Hi @gelsas, The G-NAF dataset from Australia, New Zealand's countrywide address dataset (and several others specific to particular cities or regions), and the Islandic address registry are all supported out of the box by Pelias through the wonderful OpenAddresses project. The OpenAddresses folks are pretty diligent about adding new public, appropriately licensed address datasets, so they'll generally have anything like that. You can always check in their sources list and if you do happen to find one that's missing they would definitely want to know :) |
Pelias can only be as good as the data available to it. And so far, we've done a lot to make the best use of the best collaborative open data projects we can find. But there are plenty of other sources of data that have the potential to open geocoding up to parts of the world where we don't yet offer it.
We suspect you probably know some of the best sources already that would make Pelias/Mapzen Search substantially better. Please, leave us comment below with a particular target dataset that you'd like to see Pelias/Mapzen Search rely upon.
We'd love to see new sources of:
This is why we think open will win: when people can share what we know, knowledge grows faster and we can build so much more.
The text was updated successfully, but these errors were encountered: