Improve graph performance

The performance for building the graph is currently bad in areas where there is high node density. The problem has many reasons, the most important one being that tiles are loaded with too many nodes, the nodes contain too much data and the tiles contains all the sequences for all the nodes in the tile. The sequences are retrieved for every tile, i.e. duplicates that are not needed are retrieved for adjacent tiles. The tiles can therefor be vary large which requires good network connection for fast retrieval. The aim of this issue is to completely rebuild the graph implementation to make it fast and dynamic.

To resolve the performance issue, the graph implementation needs to be completely rewritten. When doing this a number of other points should be addressed as well. The new implementation should be written in a way that supports the following:
- Uncaching of nodes (image, mesh, edges). ➕ _Separate issue, supported #25_
- Uncaching of tiles (public API on graph service to enable a cleaning component to track the graph and clean unused tiles). Uncaching a tile should flow down to cleaning of graph nodes, edges and node images and meshes. ➕ _Separate issue, supported #131_
- Uncaching of graph nodes. ➕ _Separate issue, supported #131_
- Uncaching should be based on geohash tiles as well as sequences ➕ _Separate issue, supported #131_ 
- Usage of Falcor API ✔️ 
- Different graph modes (spatial, sequence) ➕ _Separate issue, supported #67_
- Node filtering (separate issue but graph should be prepared for it) ➕ _Separate issue, supported #192_
- Reset edges of graph (required for filtering to only navigate to filtered nodes) ✔️ 
- Dynamic tile sizes and edge calculation thresholds to be able to scale the graph calculation depending on node density in area ➕ _Separate issue, supported #211_
- Node class should have an API for requesting a higher resolution image than the current one and emit the new image when finished ➕ _Separate issue, supported #212_
- Error handling and retry when API calls fails ✔️ 
- Unit and performance tests  ➕ _Separate issue, supported #213_
- Node should not have a worthy property, should be handled in graph ✔️ 
- Sequences of nodes should always be expected in edge calculation, a node sequence should never be null ✔️ 
- All methods and classes should be documented ✔️ 

The idea of the new graph implementation is for data to flow in the following way:

```
                key 
                 |
                 |
                 v
              imByKey (Core + Spatial + H) ------> Node
                / \                                 |
               /   \                                |
              v     v                               v
           imsByH  seqByKey           Image + Mesh + Core + Spatial
           (Core)   |                               |
              |     |                               |
              |     |                               v
              v     |                          Put on state 
        imsByKey    |                         (i.e. display)
  (bbox, Spatial)   |
              |   / |
              |  /  |
              | /   |
              v     v
   Spatial edges   Sequence edges
               \   /
                \ /
                 v
               Node
                / \                            
               /   \ 
              v     v
           Display arrows
```

In this way the node could potentially be put on the state once the node assets (image, mesh) have arrived but before the edges have been calculated. The lightweight tiles would not contain heavy node or sequence data and be fast to retrieve. The spatial edge calculation would rely on the `heavy` data but it would only be retrieved for exactly the image keys that are needed. The rbush spatial index will be used for determining what keys need to be turned in nodes by fetching the heavy node data. The sequence edge calculation would need to fetch the sequences for the nodes that requires edge calculation only. Falcor will be used to cache as well as batch load requests.

 The implementation of the new graph issue includes:
- [x] Data flow through observables from GraphService, Graph, Node and Edges
- [x] Falcor API usage for building graph
- [x] Ligthweight tile fetching
- [x] Heavy `imByKey` fetching for areas required for edge calculation
- [x] Two data streams for edge calculation (spatial, sequence)
- [x] Changed graph, node, state and navigation implementations.

This issue requires breaking changes to the public API, therefor the library version needs to be pushed to 2.0 when implemented.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve graph performance #191

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Improve graph performance #191

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions