Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DISCUSSION: What open datasets do you want in Pelias? #254

Open
riordan opened this issue Feb 4, 2016 · 20 comments
Open

DISCUSSION: What open datasets do you want in Pelias? #254

riordan opened this issue Feb 4, 2016 · 20 comments

Comments

@riordan
Copy link
Contributor

riordan commented Feb 4, 2016

Pelias can only be as good as the data available to it. And so far, we've done a lot to make the best use of the best collaborative open data projects we can find. But there are plenty of other sources of data that have the potential to open geocoding up to parts of the world where we don't yet offer it.

We suspect you probably know some of the best sources already that would make Pelias/Mapzen Search substantially better. Please, leave us comment below with a particular target dataset that you'd like to see Pelias/Mapzen Search rely upon.

We'd love to see new sources of:

  • Address data (so we can provide it to OpenAddresses)
  • Road network data for interpolation
  • Postcodes
  • Venues and named points of interest

This is why we think open will win: when people can share what we know, knowledge grows faster and we can build so much more.

@riordan
Copy link
Contributor Author

riordan commented Feb 4, 2016

For reference here's the datasets we currently use:

@riordan
Copy link
Contributor Author

riordan commented Feb 4, 2016

Also here are a few open sources we're tracking now. No promises if/when they'll all make it into the service (and if it makes sense to route them through other, broader projects, like OpenAddresses or Who's on First), but we want to be transparent in what we've been thinking about.

@mtmail
Copy link

mtmail commented Feb 4, 2016

It's GB postal codes. The Northern Ireland postcodes are managed by https://www.dfpni.gov.uk/publications/lps-copyright-licensing-digital-application-forms (kind of a separate Ordinance Survey). Those are reselling Royalmail data and require a commercial license. I know, pretty annoying, especially since you can download the NI postcodes just fine.

For Canadian postcodes watch http://geocoder.ca/?sued=1#updates (and/or add to their legal defense fund)

@riordan
Copy link
Contributor Author

riordan commented Feb 4, 2016

Totally, thank you for catching the NI issue, @mtmail.

And we'll keep our eyes on geocoder.ca.

@orangejulius orangejulius changed the title DISCUSSION: What open datasets do you want in Pelias/Mapzen Search? DISCUSSION: What open datasets do you want in Pelias? Feb 8, 2018
@orangejulius
Copy link
Member

I would like to propose that we build an importer for All The Places. cc @iandees

@hughcameron
Copy link

Hi @orangejulius, I noticed your Docker project for Australia. The Australian Government also makes a Geocoded National Address File each quarter which may be useful to develop an importer for.

@trescube
Copy link
Contributor

trescube commented Oct 3, 2018

Hey @hughcameron, Pelias uses openaddresses which contains that curated Australia-wide source, kept up-to-date by the inimitable @andrewharvey. Thanks for your interest in Pelias, we love new data sources!

@orangejulius
Copy link
Member

@hughcameron, yup, what @trescube said. The excellent coverage via G-NAF data is part of the reason I made the Australia Docker project. We plan to use Australia as a mid-sized build for testing pelias going forward because we know there is great address coverage. If you have a chance to try it out let me know!

@hughcameron
Copy link

Hey @orangejulius & @trescube - thanks for letting me know. I got the Australia project up and running earlier in the week using forward geocoding for validation and parsing - really powerful stuff. Great work!

@yopaseopor
Copy link

yopaseopor commented May 11, 2019

I suggest two or three public sources:
Adresses. Directorate General for Cadastre. Spainhttp://www.catastro.minhap.es/webinspire/index.html
Institut Geològic i Cartogràfic de Catalunya. Name of streets and housenumbers http://www.icgc.cat/Descarregues/Llocs/WMS-Noms-i-numeracio-de-carrers
Barcelona city: all the adresses and housenumbers: https://opendata-ajuntament.barcelona.cat/data/ca/dataset/taula-direle

@Joxit
Copy link
Member

Joxit commented May 12, 2019

I think openaddresses will be happy to have this source : https://opendata-ajuntament.barcelona.cat/data/ca/dataset/taula-direle

@trescube
Copy link
Contributor

Feel free to open a PR!

@vincentfenet
Copy link

In france, you have the official https://adresse.data.gouv.fr/ too

@Joxit
Copy link
Member

Joxit commented Mar 3, 2020

adresse.data.gouv.fr is already in OpenAddresses thanks to a member of Etalab 😄

@hennihaus
Copy link

hennihaus commented Oct 15, 2020

It would be nice to have an official importer for gtfs-data.

@gelsas
Copy link

gelsas commented Apr 4, 2021

I noticed that the Canadian Postcodes are not really good in pelias yet.

This company is making monthly updated canadian zip code data (890.000 entries right now) avaiable for free: https://www.serviceobjects.com/blog/unique-us-canadian-zip-code-files-available-download/

This would make Canadian address data much more accurate.

Is the census data from here used for canada already?
https://www12.statcan.gc.ca/census-recensement/2011/geo/bound-limit/bound-limit-eng.cfm
https://www12.statcan.gc.ca/census-recensement/2011/geo/RNF-FRR/index-eng.cfm

@orangejulius
Copy link
Member

Hi @gelsas,
Thanks for the recommendation. Open data for Canadian postcodes is hard to come by. The copyright over the data is held by Canada Post and they have defended that copyright in the past, as our fellow geocodeur Ervin from geocode.xyx knows well.

It appears that the license for that zip file from ServiceObjects is effectively proprietary, and certainly not suitable for inclusion by default in Pelias. It's likely anyone using Pelias for their own needs wouldn't want to use it either.

@gelsas
Copy link

gelsas commented Dec 8, 2021

Is this included in pelias ?
Australia:
https://data.gov.au/dataset/geocoded-national-address-file-g-naf

(This is really important question for me, since I am considering setting this up: https://github.com/mountain-pass/addressr if the data is not yet in pelias)

@gelsas
Copy link

gelsas commented Dec 8, 2021

Is this data sets already used in pelias:
iceland:
https://github.com/sveinbjornt/iceaddr

Is NZ data included already?
https://data.linz.govt.nz/

@orangejulius
Copy link
Member

Hi @gelsas,

The G-NAF dataset from Australia, New Zealand's countrywide address dataset (and several others specific to particular cities or regions), and the Islandic address registry are all supported out of the box by Pelias through the wonderful OpenAddresses project.

The OpenAddresses folks are pretty diligent about adding new public, appropriately licensed address datasets, so they'll generally have anything like that. You can always check in their sources list and if you do happen to find one that's missing they would definitely want to know :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

10 participants