Cycle basis calculations #1957

szhorvat · 2022-02-10T11:55:22Z

Continuing from #1786

This PR implements some cycle basis calculations.

Fundamental cycle basis
Unweighed minimum cycle basis

It has a special feature where it can limit the length of cycles found, which drastically reduces computation time. This will be a unique feature of igraph.

Within the minimum cycle basis computation, I represent cycles as sorted vectors of edge IDs. This makes certain operations, such as "adding" cycles, efficient. However, in the end we obtain the cycles with an edge ID ordering that does not match the cycle structure. Therefore, in a final step we order the cycles. This operation was much easier with a map data structure, so I did it in C++. I could do it in pure C without writing a map from scratch, but it would be much more complicated, and I don't have the time now. There is an option that controls whether this reordering should be done. Setting it to false will speed the function up, though I did not yet time to benchmark by how much.

This PR also adds a igraph_vector_list_remove_consecutive_duplicates function.

codecov · 2022-02-10T11:59:40Z

Codecov Report

Merging #1957 (6508af9) into develop (1d2ab9e) will increase coverage by 0.32%.
The diff coverage is 97.94%.

@@             Coverage Diff             @@
##           develop    #1957      +/-   ##
===========================================
+ Coverage    75.78%   76.11%   +0.32%     
===========================================
  Files          350      352       +2     
  Lines        57486    57749     +263     
===========================================
+ Hits         43567    43956     +389     
+ Misses       13919    13793     -126

Impacted Files	Coverage Δ
src/misc/order_cycle.cpp	`97.22% <97.22%> (ø)`
src/misc/cycle_bases.c	`97.95% <97.95%> (ø)`
src/core/typed_list.pmt	`96.38% <100.00%> (+0.16%)`	⬆️
src/constructors/lcf.c	`93.61% <0.00%> (-4.06%)`	⬇️
src/core/progress.c	`66.66% <0.00%> (-3.93%)`	⬇️
src/io/gml-tree.c	`68.03% <0.00%> (-0.23%)`	⬇️
src/linalg/arpack.c	`73.94% <0.00%> (-0.07%)`	⬇️
src/constructors/basic_constructors.c	`100.00% <0.00%> (ø)`
src/core/vector.pmt	`88.00% <0.00%> (+0.01%)`	⬆️
src/io/gml.c	`79.60% <0.00%> (+0.04%)`	⬆️
... and 8 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 1d2ab9e...6508af9. Read the comment docs.

and use it for cycle basis calculations

…culation

…ycle order

szhorvat · 2022-03-20T09:52:00Z

Please DO NOT MERGE, just give feedback. I will merge myself later. At first, I expect technical comments, as understanding the algorithm would take time. I discussed the algorithm with @vtraag last year, but I am not sure if he remembers.

include/igraph.h

include/igraph_cycles.h

src/core/typed_list.pmt

src/misc/cycle_bases.c

… docs

…longer acceptss NULL

szhorvat · 2022-03-22T11:25:52Z

Where feedback might be useful is an efficient algorithm to put edges in cycle order.

Currently, the minimum_cycle_basis() function produces cycles as sorted lists of edge IDs. This is actually sufficient for a surprisingly large number of applications, but in many cases users will expect the edge IDs to be ordered along the cycle instead. To do this, I used std::map from C++, hence the separate C++ source file. I'm not very happy with this, but it was the fastest way to handle this issue.

How the algorithm works is that it produces BFS-based fundamental cycles starting from all vertices (of degree d >= 3). This produces a list of candidate cycles of length (E - V + 1)*V, where E and V are edge and vertex counts. I.e., it produces a pretty huge list. These cycles are originally in the correct order, but they are sorted by edge ID for further processing. What we would do is to duplicate the original list in memory, and keep handles to the unsorted versions of cycles from the sorted one. However, this would complicate the code quite a bit (couldn't use vector_list anymore), and would more than double the already significant memory requirements. So I don't like it. This is why I opted to restore the cycle order on the final result instead.

szhorvat · 2022-03-22T11:31:10Z

Yet another thing where I need feedback is a refactoring for IGRAPH_HANDLE_EXCEPTIONS(...). Now the argument is a large block of code, but this causes problems. Notice the separate typedef std::map<igraph_integer_t, eid_pair_t> inclist_t; I have in that file. Why do I need that? Because writing map<igraph_integer_t, eid_pair_t> within IGRAPH_HANDLE_EXCEPTIONS(...) confuses the C preprocessor with that comma, and just won't work.

We probably want a begin/end style solution for IGRAPH_HANDLE_EXCEPTIONS, but I want to see concrete suggestions please.

…get rid of the inclist_t typedef

ntamas · 2022-03-23T10:50:01Z

Added a begin/end style solution as the first iteration. I think it's good enough. Another alternative would have been to move the function body to a separate function and just wrap a call to that function within IGRAPH_HANDLE_EXCEPTIONS().

szhorvat · 2022-03-23T10:51:21Z

src/core/exceptions.h

    catch (const std::bad_alloc &e) { IGRAPH_ERROR(e.what(), IGRAPH_ENOMEM); /* LCOV_EXCL_LINE */ } \
    catch (const std::exception &e) { IGRAPH_ERROR(e.what(), IGRAPH_FAILURE); } \
    catch (...) { IGRAPH_ERROR("Unknown exception caught.", IGRAPH_FAILURE); }

+#define IGRAPH_HANDLE_EXCEPTIONS(code) \
+    IGRAPH_HANDLE_EXCEPTIONS_BEGIN; \


This ; is a bit strange here as it expands to try {;.

This is in fact why I was reluctant to do this. Maybe we can use the following pattern?

IGRAPH_HANDLE_EXCEPTIONS_BEGIN { } IGRAPH_HANDLE_EXCEPTIONS_END;

??

A bit like

do { } while (cond);

Not sure what's best.

feel free to remove the ;, it has no effect. I just wanted to test that one is allowed to use IGRAPH_HANDLE_EXCEPTIONS_BEGIN or _END with or without a semicolon at the end without breaking thigs.

ntamas · 2022-03-23T10:51:56Z

Where feedback might be useful is an efficient algorithm to put edges in cycle order.

Do you mean that the input of the algorithm is a graph and a list of edge IDs in arbitrary order that are guaranteed to form a cycle, and the result should be a list where the edge IDs are in the order they are traversed along the cycle? I don't see any particular problem with the current implementation.

szhorvat · 2022-03-23T11:13:19Z

Let me clarify.

Here's an example of a graph with edges labelled by ID:

3, 4, 8, 5 form a cycle, in this order. However, the (internal) computation returns the edge ID in sorted order, 3, 4, 5, 8. I use a separate function, igraph_i_order_cycle(), to put them back in cycle order, i.e. 3, 4, 8, 5. This function is written in C++, not C, because it uses std::map.

I found this a bit ugly.

szhorvat changed the base branch from master to develop February 10, 2022 11:55

szhorvat mentioned this pull request Feb 10, 2022

Cycle basis calculations #1786

Closed

szhorvat force-pushed the develop branch from 627b30d to e8586bc Compare March 5, 2022 20:55

szhorvat force-pushed the feature/cycle-bases-develop branch 2 times, most recently from 30db2f1 to fbbce79 Compare March 19, 2022 20:17

szhorvat added 11 commits March 20, 2022 08:32

add fundamental cycles and minimum cycle basis

99e4374

bugfixes

97675ec

comments, docs, 'complete' arg

843b764

comments, fix FINALLY_CLEAN

b5fbb42

fix: free/destroy elements of output variable

07966bc

adapt to vector_lex_cmp bugfix

5f727e6

refactor: update for integer transition and new error type

20228d9

tests: add expected output for cycle basis test

123ef44

refactor: update cycle basiss functions to use vector_int_list

aaba598

refactor: add vector_list_remove_consecutive_duplicates()

3acd400

and use it for cycle basis calculations

refactor: completely eliminate use of vector_ptr from cycle basis cal…

4f4764c

…culation

szhorvat force-pushed the feature/cycle-bases-develop branch from d710e9f to 4f4764c Compare March 20, 2022 07:32

szhorvat added 3 commits March 20, 2022 10:36

feat: allow for ordering each minimum weight cycle basis element in c…

adf5002

…ycle order

fix: extra FINALLY_CLEAN in fundamental cycle basis calculation

3794f71

tests: tests for fundamental cycle basis

58956da

szhorvat marked this pull request as ready for review March 20, 2022 09:51

szhorvat requested review from ntamas and vtraag March 20, 2022 09:51

tests: update expected output

e5eecad

ntamas reviewed Mar 21, 2022

View reviewed changes

szhorvat added 3 commits March 21, 2022 15:33

refactor: remove extra newline from igraph.h header

30dd347

refactor: improve vector_list_remove_consecutive_duplicates() and its…

5e9f6e0

… docs

cycle bases: comments, checks and remove_consecutive_duplicates() no …

e10c390

…longer acceptss NULL

szhorvat added 2 commits March 21, 2022 16:04

fix: check for overflow in cycle basis calculations

7b44078

docs: mark cycle basis functions as experimental

9fd0f01

refactor: use IGRAPH_HANDLE_EXCEPTIONS_BEGIN and _END to allow us to …

bc64568

…get rid of the inclist_t typedef

szhorvat commented Mar 23, 2022

View reviewed changes

szhorvat added 3 commits March 25, 2022 16:07

refactor: change argument order for cycle basis functions

10a6512

refactor: rename cutoff to bfs_cutoff in cycle basis calculations

38b91a8

feat: add interface for cycle basis functions

6508af9

szhorvat force-pushed the feature/cycle-bases-develop branch from cdb3b24 to 6508af9 Compare March 25, 2022 15:22

szhorvat requested a review from ntamas March 25, 2022 15:23

ntamas approved these changes Mar 25, 2022

View reviewed changes

szhorvat merged commit 4055643 into igraph:develop Mar 26, 2022

This was referenced Apr 3, 2022

Wishlist: Cycle basis of undirected graph #1252

Closed

Weighted minimum cycle basis #2019

Open

GenieTim mentioned this pull request Aug 25, 2022

Find all simple cycles #1398

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cycle basis calculations #1957

Cycle basis calculations #1957

szhorvat commented Feb 10, 2022 •

edited

Loading

codecov bot commented Feb 10, 2022 •

edited

Loading

szhorvat commented Mar 20, 2022 •

edited

Loading

szhorvat commented Mar 22, 2022

szhorvat commented Mar 22, 2022

ntamas commented Mar 23, 2022

szhorvat Mar 23, 2022 •

edited

Loading

ntamas Mar 23, 2022

ntamas commented Mar 23, 2022

szhorvat commented Mar 23, 2022 •

edited

Loading

Cycle basis calculations #1957

Cycle basis calculations #1957

Conversation

szhorvat commented Feb 10, 2022 • edited Loading

codecov bot commented Feb 10, 2022 • edited Loading

Codecov Report

szhorvat commented Mar 20, 2022 • edited Loading

szhorvat commented Mar 22, 2022

szhorvat commented Mar 22, 2022

ntamas commented Mar 23, 2022

szhorvat Mar 23, 2022 • edited Loading

Choose a reason for hiding this comment

ntamas Mar 23, 2022

Choose a reason for hiding this comment

ntamas commented Mar 23, 2022

szhorvat commented Mar 23, 2022 • edited Loading

szhorvat commented Feb 10, 2022 •

edited

Loading

codecov bot commented Feb 10, 2022 •

edited

Loading

szhorvat commented Mar 20, 2022 •

edited

Loading

szhorvat Mar 23, 2022 •

edited

Loading

szhorvat commented Mar 23, 2022 •

edited

Loading