Modified Tarjan SCC to reduce memory usage. #413

saolof · 2021-04-16T23:38:09Z

Reduced heap memory usage, improved cache locality, and made the code somewhat cleaner for a possible future rewrite to an imperative implementation. The changes to a more modern version of Tarjan's algorithm were directly based on algorithm 3 in "A Space-Efficient Algorithm for Finding Strongly Connected Components" by David J. Pearce, while also aiming to avoid changing the structure of the code too much.

In addition to saving memory, this version has the property that it builds a lookup
table for the components at the same time as it outputs them. If v belongs to the kth output component, then
self.nodes[g.to_index(v)] == Some(k) . This could be used to save work in
functions that may want to call it such as condensation (where the work needed to make the node map can be completely avoided, while also creating a condensation that is topologically sorted).

Reduced heap memory usage to a single vec allocation, improved cache locality, and made the code cleaner for a possible future rewrite to a more optimized imperative version. In addition, the new version has the property that it builds a lookup table for the components. If v belongs to the kth output component, then self.nodes[g.to_index(v)] == Some(k) . This can be used to save work in functions that may want to call it such as condensation.

saolof · 2021-04-24T11:59:54Z

Another quick comment: because of the way it builds up a component lookup table, this version would also enable a sort_scc_stable function (returning a list of SCCs where the nodes appear in the order given by the original graph or some iterator over its nodes), which would be very useful in some situations.

Basically, since a node will be assigned to its proper SCC when visit returns from the top level loop, that means you can bucket sort nodes by their rindex value in the top level loop instead of when you pop them from the stack.

ABorgna

Really nice !

In addition to the comments, it would be nice to have some benchmarks to compare the changes.

src/algo/mod.rs

ABorgna · 2021-04-26T16:05:27Z

I ran #421's benchmarks, and I'm seeing some noticeable performance regressions (I include Kosaraju's for reference):

 name                        master ns/iter  new_tarjan ns/iter  diff ns/iter  diff %  speedup
 bigger_kosaraju_sccs        6,000           6,047                         47   0.78%   x 0.99
 bigger_tarjan_sccs          3,299           3,879                        580  17.58%   x 0.85
 full_kosaraju_sccs          1,047           1,047                          0   0.00%   x 1.00
 full_tarjan_sccs            538             658                          120  22.30%   x 0.82
 sccs_kosaraju_graph         2,687           2,607                        -80  -2.98%   x 1.03
 sccs_kosaraju_stable_graph  2,691           2,669                        -22  -0.82%   x 1.01
 sccs_tarjan_graph           1,809           2,134                        325  17.97%   x 0.85
 sccs_tarjan_stable_graph    1,755           2,087                        332  18.92%   x 0.84

I'll try to do some profiling.

Tests the visited flag before doing the recursive call.

ABorgna · 2021-04-26T19:13:18Z

I pushed a small change to your branch, it checks if the node has been visited before calling visit and hence avoids the recursive calls.
Now the benchmarks are much nicer :)

 name                        master ns/iter  new_tarjan_check ns/iter  diff ns/iter   diff %  speedup
 bigger_kosaraju_sccs        5,791           5,402                             -389   -6.72%   x 1.07
 bigger_tarjan_sccs          3,002           2,817                             -185   -6.16%   x 1.07
 full_kosaraju_sccs          1,156           976                               -180  -15.57%   x 1.18
 full_tarjan_sccs            487             438                                -49  -10.06%   x 1.11
 sccs_kosaraju_graph         2,405           2,367                              -38   -1.58%   x 1.02
 sccs_kosaraju_stable_graph  2,263           2,261                               -2   -0.09%   x 1.00
 sccs_tarjan_graph           1,460           1,249                             -211  -14.45%   x 1.17
 sccs_tarjan_stable_graph    1,478           1,223                             -255  -17.25%   x 1.21

ABorgna · 2021-04-26T19:31:13Z

Uhm, I'm getting pretty noisy benchmarks (full_kosaraju_sccs there got -15% with no code changes).
But the Tarjan implementations is consistently faster with your implementation.

If you are OK with this I think the PR is ready to merge.

saolof · 2021-04-26T20:12:52Z

Sounds great! When I have more time I'll work on an imperative version that can't stack overflow and microoptimize it in a followup PR. There's also a few other functions that depend on the SCC's such as condensation that could be rewritten to take advantage of the new node_component_index method in order to avoid work.

* Modified Tarjan SCC to reduce memory usage. Reduced heap memory usage to a single vec allocation, improved cache locality, and made the code cleaner for a possible future rewrite to a more optimized imperative version. In addition, the new version has the property that it builds a lookup table for the components. If v belongs to the kth output component, then self.nodes[g.to_index(v)] == Some(k) . This can be used to save work in functions that may want to call it such as condensation. * added std:: in front of std::usize::MAX * Rustfmt-ed the code. * Added node_component_index, adopted NonZeroUsize, reversed counters. * Minor change to node_component_index. * Docs update. Added citation to paper with link. * Improve tarjan scc performance Tests the visited flag before doing the recursive call. Co-authored-by: Agustin Borgna <agustinborgna@gmail.com>

saolof added 4 commits April 16, 2021 19:14

added std:: in front of std::usize::MAX

1bbde7d

Rustfmt-ed the code.

a9cba70

Merge branch 'master' of https://github.com/saolof/petgraph

ec6e0d5

ABorgna reviewed Apr 25, 2021

View reviewed changes

src/algo/mod.rs Show resolved Hide resolved

src/algo/mod.rs Outdated Show resolved Hide resolved

src/algo/mod.rs Outdated Show resolved Hide resolved

saolof added 3 commits April 26, 2021 07:22

Added node_component_index, adopted NonZeroUsize, reversed counters.

e3eceed

Minor change to node_component_index.

bd24e02

Docs update. Added citation to paper with link.

31ee7cf

ABorgna mentioned this pull request Apr 26, 2021

Add benchmarks for Tarjan's SCC algorithm #421

Merged

Improve tarjan scc performance

cace4c7

Tests the visited flag before doing the recursive call.

ABorgna merged commit 5e3a5b5 into petgraph:master Apr 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modified Tarjan SCC to reduce memory usage. #413

Modified Tarjan SCC to reduce memory usage. #413

saolof commented Apr 16, 2021 •

edited

saolof commented Apr 24, 2021

ABorgna left a comment

ABorgna commented Apr 26, 2021

ABorgna commented Apr 26, 2021

ABorgna commented Apr 26, 2021

saolof commented Apr 26, 2021 •

edited

Modified Tarjan SCC to reduce memory usage. #413

Modified Tarjan SCC to reduce memory usage. #413

Conversation

saolof commented Apr 16, 2021 • edited

saolof commented Apr 24, 2021

ABorgna left a comment

Choose a reason for hiding this comment

ABorgna commented Apr 26, 2021

ABorgna commented Apr 26, 2021

ABorgna commented Apr 26, 2021

saolof commented Apr 26, 2021 • edited

saolof commented Apr 16, 2021 •

edited

saolof commented Apr 26, 2021 •

edited