How I got valhalla running #3540

timwis · 2022-02-18T08:01:12Z

timwis
Feb 18, 2022

Hello all, I love what I've read about valhalla in its docs, and I'd like to use it for an application I'm building to help people find mutually convenient locations to meet. Specifically, I'd like to use the isochrone generation service, primarily with public transport data, but also with other modes of travel. My initial focus will be London and the UK, but would eventually like to support more areas, and it looks like I can do all of that with valhalla.

But I've had some trouble getting it running. Some of the documentation is still a work in progress, and I ran into issues with some of the build scripts on my M1 macbook pro (when running in docker), e.g.:

$ valhalla_build_admins --config valhalla.json
qemu: uncaught target signal 11 (Segmentation fault) - core dumped

So I decided to fire up a digital ocean virtual private server (vps) to eliminate the M1 chip from the equation.

I decided to document everything so that (a) I could eventually reproduce it when it comes time to run it in production, and (b) in case anyone else comes across this repo and, like me, hasn't worked with C++ before and is struggling to get it running, and (c) to potentially surface areas where documentation could be improved (which I'm happy to help with).

It's more of a live coding exercise than a tutorial, but I hope it's helpful! Perhaps I'll adapt it into a tutorial if that would be useful.

setting up the server

I used a Digital Ocean virtual private server (vps) because they're cheap ($5 USD per month) and the interface is much simpler than the other cloud providers.

I could have used the documentation on running valhalla on linux, but I chose to use the docker image because I'd like to eventually run this in production, possibly on a PaaS, and perhaps even on my macbook during development, and the docker image is more portable.

So I installed docker and docker compose.

I then created a /valhalla directory with a /resources directory inside it. I created /valhalla/docker-compose.yml with these contents:

services:
  valhalla-run:
    image: valhalla/valhalla:run-3.1.4
    ports:
      - "8002:8002"
    volumes:
      - ./resources:/valhalla/resources

This makes it easier to run the valhalla 'run' docker image, persisting the artefacts it generates, and exposing the server's port to the host machine.

I then ran this image and attached my terminal to it via:

$ docker-compose run valhalla-run bash

This put me inside the container, where I navigated to /valhalla/resources, and ran all my commands from there.

preparing to run valhalla

I originally assumed that the valhalla docker image was "ready to go" and I could immediate generate isochrones with it. In fact, you must first download and prepare data for it to use, as well as a config to point to that data.

I use the 'running' section of the README as guidance for this.

generating the config

Running valhalla_build_config throws an error:

ModuleNotFoundError: No module named 'distutils.util'

So we need to install distutils

$ sudo apt-get update
$ sudo apt-get install python3-distutils python3-apt

That should probably be included in the docker image, though perhaps it's because we're using the 'run' image instead of the 'build' image.

Now we can generate the build config with default settings

$ valhalla_build_config > /valhalla/resources/valhalla.json

We'll need to modify some of the default settings in the build config. I used vim since I'm on a VPS, but you can use any text editor.

Update log paths to /valhalla/resources/{service-name}.log
Update other paths from /data/valhalla/ to /valhalla/resources/

Next we'll download OSM data for greater london into the /valhalla/resources directory

$ cd /valhalla/resources
$ wget http://download.geofabrik.de/europe/great-britain/england/greater-london-latest.osm.pbf

Now we'll follow the steps in the documentation's optional prerequisites section for mjolnir.

Administrative areas

From /valhalla/resources, we'll run:

$ valhalla_build_admins --config valhalla.json greater-london-latest.osm.pbf

This threw a handful of errors:

[ERROR] sqlite3_step() error: NOT NULL constraint failed: admin_access.admin_id.  Ignore if not using a planet extract or check if there was a name change for Australia

I think that's because we used greater london data instead of "planet" level data. So my guess is this step was probably somewhat useless with this data as there are no country crossings within greater london.

Timezones

This one was pretty simple:

$ valhalla_build_timezones > /valhalla/resources/tz_world.sqlite

Transit

First we'll make sure the target directory exists

$ mkdir -p /valhalla/resources/transit

Running valhalla_build_transit without any arguments shows the usage is:

Usage: valhalla_build_transit valhalla_config transit_land_url per_page [target_directory] [bounding_box] [transitland_api_key] [import_level] [feed_onestop_id] [onestop_test]
Sample: valhalla_build_transit conf/valhalla.json http://transit.land 1000 ./transit_tiles -122.469,37.502,-121.78,38.018 transitland-YOUR_KEY_SUFFIX 4 f-9q9-bart

The bounding box argument isn't clear whether it's in the order of latitude-longitude or longitude-latitude. According to the OpenStreetMap Wiki:

Latitude is a decimal number between -90.0 and 90.0.
Longitude is a decimal number between -180.0 and 180.0.

From this we can infer that it's longitude-latitude.

That wiki page also happens to have an example for greater london, which saves us time looking that up: -0.489,51.28,0.236,51.686

For the TransitLand API key, we'll need to sign up.

I'm not sure what import_level means, so we'll just use 4 from the sample.

To find Transport for London's Onestop ID, we can search London on TransitLand's operators index. It looks like the Onestop ID for TfL is o-gcpv-transportforlondon, but according to the Onestop ID explanation, the 'o' is for operator, and the valhalla_build_transit usage prompt calls for a feed_onestop_id, so we'll swap the 'o' for an 'f'.

So the full command ended up being:

$ valhalla_build_transit valhalla.json https://transit.land 1000 ./transit -0.489,51.28,0.236,51.686 MY_API_KEY 4 f-gcpv-transportforlondon

Unfortunately this threw an error:

[WARN] 500'd retrying https://transit.land/api/v1/feeds.geojson?per_page=false&api_key=MY_API_KEY&bbox=-0.489,51.28,0.236,51.686&active_feed_version_import_level=4

I copied the URL into my browser and saw a more descriptive error message:

{"message": "[NoMethodError] undefined method `onestop_id' for nil:NilClass"}

🤔 Hm. Now that I look again at the URL, I don't see the onestop_id I provided in there. Perhaps the cause for the error.

Maybe the usage sample on the valhalla_build_transit CLI is out of date. Let's look at TransitLand's API docs. (EDIT: I later realised valhalla uses an old version of the TransitLand API, v1)

There's a parameter called onestop_id. If we append &onestop_id=f-gcpv-transportforlondon to the URL in the browser, it works. Why isn't valhalla_build_transit doing that?

Let's search the valhalla codebase for feeds.geojson to see if we can find the code for constructing that URL.

If I'm understanding this correctly—and I don't read C++ well—it looks like the onestop_id is not added to the URL; instead, valhalla_build_config looks for the result that has a matching onestop_id.

So let's search valhalla's issues for any reference to this. It may also be a bug in transitland's api.

I don't see any reference to it in valhalla's issues. Poking around transitland's GitHub org, this repo looks like the one that powers the web service, which checks out because the error looks like a ruby error.

I searched transitland-datastore's issues for NoMethodError and saw some other issues, but not this one. So I searched its source code for onestop_id in the /app path, but there are 62 results, and without line or file information from the error, it would be very hard to narrow it down without running the whole app locally.

In the process of posting an issue, though, I realised I was looking at the wrong version of the transitland api docs. Valhalla's making requests to /v1/, which has its own documentation page. Sure enough, there's an example query there in the same format:

/api/v1/feeds?bbox=-122.4183,37.7758,-122.4120,37.7858

This suggests valhalla is making an appropriate query, and the issue is on the transitland side (albeit on an old and probably deprecated version of their API). I tried a few other variations of the URL to try and narrow down the issue, but they all yielded the same error message, so I posted an issue on the transitland-datastore repo and cross-posted on valhalla's repo for visibility.

I think for now we'll continue without transit data, and revisit when we hear back on those issues.

EDIT: The TransitLand team fixed this issue, and the valhalla_build_transit tool worked with the above command, but then quickly ran into a rate limiting issue. I think the way I'll get around this is by running transitland v1 locally, and replacing the TransitLand endpoint in the build command with my local server's endpoint.

Building tiles

I originally assumed tiles were only for basemaps, which I didn't need from valhalla, but it turns out they're used for routing, which is necessary for generating isochrones. So we'll need to build them.

I can see an example query in valhalla's readme, so we'll try that:

$ valhalla_build_tiles -c valhalla.json greater-london-latest.osm.pbf

This command generates a plethora of lines with green in them, and no red, which is always a great feeling.

After a minute or so, though, all I'm getting is this warning for about ten minutes:

[WARN] Exceeding maximum.  Average speed: 141

My guess is this has to do with the small amount of RAM that comes with digital ocean's tiniest VPS. Fortunately, a bunch of green lines came after that, and it finished 🎉

Building an extract

Valhalla's readme lists another command after this: valhalla_build_extract, which it says is to build a tile index for faster graph loading times. We'll run the command used in the example:

$ valhalla_build_extract -c valhalla.json -v

Unfortunately, this throws an error:

bash: valhalla_build_extract: command not found

Typing valhalla_ and pressing TAB shows me all the available valhalla scripts, and that isn't one of them.

I can see valhalla_build_extract is still in the repo, but for some reason it's not available in the docker 'run' image.

Looking at the run image's dockerfile, this may be deliberate. Perhaps we should have been using the build image this whole time, and perhaps that's why the python-distutils package was missing when we started.

Fortunately, we mounted the /resources directory when running the 'run' container, so if we stop the container and run a 'build' container with the same directory mounted, we shouldn't need to rebuild anything.

Let's add the 'build' image to our docker-compose.yml file, indented under services like the 'run' image:

  valhalla-build:
    image: valhalla/valhalla:build-3.1.4
    volumes:
      - ./resources:/valhalla/resources

(We'll omit the ports property since we can assume build isn't meant to be the one running a web server)

We'll run it using:

$ docker-compose run valhalla-build bash

And then we'll try that valhalla_build_extract command again.

$ cd /valhalla/resources
$ valhalla_build_extract -c valhalla.json -v

Unfortunately, this fails too, with the same command not found error. Pressing TAB after typing valhalla_ doesn't show any scripts this time.

I can see in the dockerfile for the build image it runs install-shared-deps.sh, so I tried running that manually, but that didn't fix the issue.

Looking at the Dockerfile and install-shared-deps.sh, neither appear to copy the valhalla source code (which includes the valhalla_build_extract script), which the 'run' Dockerfile does, so maybe we were right to use the 'run' container after all.

The valhalla readme does offer an alternative to the valhalla_build_extract command, of simply tarring up the tiles, so let's try that.

First, let's switch back to the 'run' image by pressing CTRL+C to detach from and stop the 'build' container, then running the 'run' container:

$ docker-compose run valhalla-run bash

Once we're in the container, navigate back to the /valhalla/resources directory and try the command from the readme, changing the name valhalla_tiles to tiles, to match the paths in our config file:

find tiles | sort -n | tar cf tiles.tar --no-recursion -T -

That's all the steps listed in the readme, so I think we're ready to try running the server!

Running the server

The docker-compose run command can't expose ports, so we'll need to exit the container (CTRL+C) and use docker-compose up. This will immediately stop the container, because the run container doesn't have a CMD in it (deliberately). So we'll need to add a command to our docker-compose.yml file:

  valhalla-run:
    image: valhalla/valhalla:run-3.1.4
    command: valhalla_service /valhalla/resources/valhalla.json 1
    ports:
    ...

I believe the 1 at the end is the number of cores. We'll run this service using:

$ docker-compose up -d valhalla-run

We get 1 success message and 3 warnings:

[INFO] Tile extract successfully loaded with tile count: 13
[WARN] (stat): /valhalla/resources/traffic.tar No such file or directory
[WARN] Traffic tile extract could not be loaded
[WARN] /valhalla/resources/elevation/ currently has no elevation tiles

These warnings make sense because we didn't load or process traffic or elevation data, but our config file still has the default paths in it, so valhalla looked for it.

Let's see if it works anyway. In another terminal tab, we'll SSH into our digital ocean vps again:

ssh root@VPS_IP_ADDRESS

Then we can try making a request to the server we're running via docker.

From the isochrone docs, we can see that the server runs on port 8002 (which is why we exposed that port in our docker-compose.yml), and it gives a sense of the expected URL structure, along with a sample request. Let's use curl to send that sample request:

curl localhost:8002/isochrone?json={"locations":[{"lat":40.744014,"lon":-73.990508}],"costing":"pedestrian","contours":[{"time":15,"color":"ff0000"}]}&id=Walk_From_Office

This returns an error:

{"error_code":100,"error":"Failed to parse json request","status_code":400,"status":"Bad Request"}curl: (3) bad range in URL position 49:
http://localhost:8002/isochrone?json=locations:[lon:-73.990508]

If you look closely, this is actually a curl related issue: the double quotes in the URL is likely the problem. curl probably expects them to be encoded (e.g. %22).

This article offers a very elegant solution, which we'll try:

$ curl -G --data-urlencode 'json={"locations":[{"lat":40.744014,"lon":-73.990508}],"costing":"pedestrian","contours":[{"time":15,"color":"ff0000"}]}&id=Walk_From_Office' localhost:8002/isochrone

This returns an error from valhalla:

{"error_code":171,"error":"No suitable edges near location","status_code":400,"status":"Bad Request"}

If we search valhalla's issues for that error message, we can see a comment from Kevin that it means either:

you dont have your data properly loaded so that no input locations are near anything because nothing is loaded
or your input is erroneous, did you switch lat and lon for example?

Now that I think about it, we only loaded in data for greater london, but we copied and pasted a query from the isochrone docs containing a lat/lon in New York.

Let's try it with a London lat/lon:

$ curl -G --data-urlencode 'json={"locations":[{"lat":51.5072,"lon":-0.1276}],"costing":"pedestrian","contours":[{"time":15,"color":"ff0000"}]}&id=Walk_From_Office' localhost:8002/isochrone

It worked!! 🎉 We got back a GeoJSON LineString object. And if we try to access the digital ocean server directly from my laptop's browser (rather than via SSH) on port 8002, it works! That means the server must have port 8002 open to the public internet by default.

nilsnolde · 2022-02-18T08:46:40Z

nilsnolde
Feb 18, 2022
Maintainer

Wow, congrats you pulled through all the way and thanks for sharing! It’s really nice to have someone sorting out the transitland stuff again, was always too lazy myself😅 Didn’t read it all but couple of comments:

have a look at our valhalla image, that might save some trouble (it likely lacks the transit executables but that’s a 2 secs Dockerfile fix to add them): https://github.com/gis-ops/docker-valhalla
your initial idea reminds of @kevinkreiser ‘s experimental /centroid endpoint (Note it’s undocumented): Introducing A Centroid/Converge/Rendezvous/Meet API #2734

0 replies

kevinkreiser · 2022-02-18T13:36:06Z

kevinkreiser
Feb 18, 2022
Maintainer

Was going to mention the centroid api as well for finding convenient places to meet.

Also seems like your found some warts, we will get those patched over. Thanks for reporting!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How I got valhalla running #3540

{{title}}

Replies: 2 comments

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

How I got valhalla running #3540

timwis Feb 18, 2022

setting up the server

preparing to run valhalla

generating the config

Administrative areas

Timezones

Transit

Building tiles

Building an extract

Running the server

Replies: 2 comments

nilsnolde Feb 18, 2022 Maintainer

kevinkreiser Feb 18, 2022 Maintainer

timwis
Feb 18, 2022

nilsnolde
Feb 18, 2022
Maintainer

kevinkreiser
Feb 18, 2022
Maintainer