Memory usage increases linearly with server_threads #81

ktnr · 2022-12-08T21:56:53Z

Follow up on #58 and #63. With server_threads=1, memory usage increases to 4GB on the first call and does not increase on subsequent calls (with exactly the same matrix request). Everything is fine. With server_threads=2, memory usage increases to 4GB on the first call and 8GB on the second call, but does not increase on the third call.

At first I had server_threads unset because I followed this tutorial, so it defaulted to nproc. This explains the increase in memory usage atleast in my case, as explained in #58 (comment).

Is this expected behavior? The data is only accessed by read operations, right?

The text was updated successfully, but these errors were encountered:

kevinkreiser · 2022-12-08T22:08:10Z

When using memory mapping each thread gets access to the map which will reported as the amount the os has ram cached multiplied by the number of open handles to the map. In this way memory usage can actually report above 100 percent iirc. Matrix though had some interesting dynamically allocated memory requirements which we may have neglected to trim back after each request. Do you get this or a similar behavior with route requests?

nilsnolde · 2022-12-08T22:14:37Z

Well, the funny thing is it happens only with our image, not with the Valhalla native one. With the same graph and the same requests. Don't understand.. Or know how to approach that..

ktnr · 2022-12-08T22:41:10Z

@nilsnolde: Cannot yet confirm that it's exactly the same behavior for the native valhalla image as I'm not sure I set/it defaulted to the same server_thread count.

Will also test the behavior for route requests.

ktnr · 2022-12-09T08:27:06Z

Alright. Same memory usage behavior for the matrix and route endpoint when using gis-ops/valhalla and valhalla/valhalla with the same server_threads, even when executing the same exact request multiple times: linear increase with server_threads and capped by max ram usage * server_threads.

For completeness, I'll attach the matrix request and valhalla config: valhalla-memory_increase.zip. Here's the route request

curl http://localhost:8002/route --data '{"locations":[{"lat":47.619904,"lon":12.902326},{"lat":51.717959,"lon":6.217763}],"costing":"auto"}'

and the compose file:

version: '3.0'
services:
  valhalla:
    image: gisops/valhalla:latest
    ports:
      - "8002:8002"
    volumes:
      - ${HOME}/downloads/map-data/valhalla/custom_files:/custom_files
    #mem_limit: 10g
    #cpus: 1
    environment:
      # The tile_file must be located in the `custom_files` folder.
      # The tile_file has priority and is used when valid.
      # If the tile_file doesn't exist, the url is used instead.
      # Don't blank out tile_url when you use tile_file and vice versa.
      - tile_urls=europe/germany-latest.osm.pbf
      - use_tiles_ignore_pbf=True
      - force_rebuild=False
      - force_rebuild_elevation=False
      - build_elevation=False
      - build_admins=True
      - build_time_zones=True
      - server_threads=2  # determines how many threads will be used to run the valhalla server
      
  valhalla-native:
    image: valhalla/valhalla:run-latest
    command:
      - /bin/bash
      - -c
      - |
        valhalla_service custom_files/valhalla.json 2 # The second argument specifies the nuzmber of `server_threads`
    ports:
      - 8002:8002
    volumes:
      - ${HOME}/downloads/map-data/valhalla/custom_files:/custom_files
    #mem_limit: 10g
    #cpus: 1

From what @kevinkreiser said above and in valhalla/valhalla#3405 (comment), I wouldn't expect the increase in memory.

Also referencing valhalla/valhalla#3556, as it might be relevant and may solve the issue of @elliveny. Note, limiting the number/share of cpus in the compose file does not set the memory cap, only limiting server_threads does.

nilsnolde · 2022-12-09T09:38:16Z

Thanks, that's super helpful and also relieving.. I don't really understand all the implications of threading vs multi-processing with regards to mem mapping and tile cache(s). Will have a session with the others to fully understand myself, then write it down in some docs in the upstream repo.

Though it still might be that we're not resetting some stuff in the matrix code the way we should..

ktnr · 2022-12-09T09:45:10Z

Glad the info helped. It would be great if you could link the write-up here. Love the valhalla ecosystem btw, keep it up.

nilsnolde · 2022-12-20T14:18:13Z

So finally I understand the operations stuff much better after a talk with @kevinkreiser . I'm sure we'll write it up at some point. It's quite involved though, so for anyone to really understand the internals, it'd have to be pretty detailed. What's possibly the least intuitive for newcomers/not-harcore-programmers is that in most environments you'd want to work with the tar archive, which leaves the memory consumption mostly to the OS, not Valhalla. But at least our image uses the tar by default, we don't even keep the plain tiles directory I think.

The other place that needs considerable RAM is the routing algos while expanding and the bidir matrix is by far the greediest. And that's happening per request, where after the first request it'll keep a considerable chunk of RAM allocated to avoid that penalty for the next request, though it does some (configurable) trimming. What Kevin was referring to: the matrix algo(s) might not trim enough their allocated memory after a request (even though a quick skim over the code looked fine, even too fine, it doesn't seem to keep any allocation..). So that's likely the place where we'd need to look.

To better reason about your situation: can you share your Valhalla config JSON? Only in case it's not the default.

ktnr · 2022-12-20T21:01:06Z

Are you using standard memory allocators? The config is already uploaded in #81 (comment): valhalla-memory_increase.zip.

nilsnolde · 2022-12-20T21:35:08Z

Are you using standard memory allocators?

I‘m not a CS master, but yeah, AFAICT it’s the standard allocators coming with unordered_map & vectors.

kk2491 · 2023-01-12T13:30:42Z

I am facing similar issue, can anybody please explain me how can we fix this issue?

nilsnolde · 2023-01-12T14:57:42Z

enquiry@gis-ops.com ☺️

nilsnolde · 2023-02-23T09:08:21Z

So, turns out this is a feature, not a bug. Sorry to everyone, I also learned smth here..

nilsnolde · 2023-04-09T19:50:42Z

There's problems with the matrix, see valhalla/valhalla#4064. No one really bothered to take a look back then, but @kevinkreiser had the right hunch here #81 (comment)

ktnr · 2024-04-09T22:13:29Z

I am unsure whether I have understood the expected behavior correctly, especially after valhalla/valhalla#4064.

Seeing the newer comments in valhalla/valhalla#3556, it seems others are still having issues and experiencing similar problems or misunderstand the expected behavior. In the issue, it is also mentioned that memory usage should be more efficient when using the tar files.

I rerun the test described in this issue. I have built my tiles with the default options as described in the Readme, which includes build_tar = True and use_tiles_ignore_pbf = True by default.

With these settings, I still get the same behavior as described above, where memory usage increases linearly (with the exact same requests sent repeatedly) with the number of server_threads and is capped by ~osm-extract-size * server_threads. Since the tiles/tar is accessed in a read-only manner (I suppose), I would not expect to see the increase in RAM usage. To me, this suggests that each thread is independently mapping or allocating memory without sharing it effectively with the other threads.

nilsnolde closed this as completed Feb 23, 2023

This was referenced Feb 23, 2023

memory not being released #63

Closed

Valhalla memory leak #58

Closed

nilsnolde mentioned this issue Jun 9, 2023

Potential memory leak in Matrix API valhalla/valhalla#3556

Closed

ktnr mentioned this issue May 17, 2024

Memory mapping with multiple threads does not share tile memory valhalla/valhalla#4736

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory usage increases linearly with server_threads #81

Memory usage increases linearly with server_threads #81

ktnr commented Dec 8, 2022 •

edited

Loading

kevinkreiser commented Dec 8, 2022

nilsnolde commented Dec 8, 2022

ktnr commented Dec 8, 2022

ktnr commented Dec 9, 2022 •

edited

Loading

nilsnolde commented Dec 9, 2022 •

edited

Loading

ktnr commented Dec 9, 2022

nilsnolde commented Dec 20, 2022

ktnr commented Dec 20, 2022

nilsnolde commented Dec 20, 2022

kk2491 commented Jan 12, 2023

nilsnolde commented Jan 12, 2023

nilsnolde commented Feb 23, 2023

nilsnolde commented Apr 9, 2023

ktnr commented Apr 9, 2024

Memory usage increases linearly with server_threads #81

Memory usage increases linearly with server_threads #81

Comments

ktnr commented Dec 8, 2022 • edited Loading

kevinkreiser commented Dec 8, 2022

nilsnolde commented Dec 8, 2022

ktnr commented Dec 8, 2022

ktnr commented Dec 9, 2022 • edited Loading

nilsnolde commented Dec 9, 2022 • edited Loading

ktnr commented Dec 9, 2022

nilsnolde commented Dec 20, 2022

ktnr commented Dec 20, 2022

nilsnolde commented Dec 20, 2022

kk2491 commented Jan 12, 2023

nilsnolde commented Jan 12, 2023

nilsnolde commented Feb 23, 2023

nilsnolde commented Apr 9, 2023

ktnr commented Apr 9, 2024

ktnr commented Dec 8, 2022 •

edited

Loading

ktnr commented Dec 9, 2022 •

edited

Loading

nilsnolde commented Dec 9, 2022 •

edited

Loading