feat: igraph_joint_degree_distribution #2422

szhorvat · 2023-10-30T16:43:26Z

Docs and implementation are complete, but no tests yet.

I'd like to have #2419 merged before adding tests and finalizing the cross referencing of docs.

The following is a nice visual illustration of why we need the joint degree distribution (and not joint degree matrix) for statistical applications dealing with degree correlations.

Join degree matrix of a large undirected Erdős-Rényi graph:

Joint degree distribution of the same:

codecov · 2023-10-30T16:49:09Z

Codecov Report

Merging #2422 (af0ebc8) into master (e6b49f4) will increase coverage by 0.04%.
Report is 1 commits behind head on master.
The diff coverage is 86.66%.

@@            Coverage Diff             @@
##           master    #2422      +/-   ##
==========================================
+ Coverage   83.68%   83.73%   +0.04%     
==========================================
  Files         377      377              
  Lines       62137    62401     +264     
==========================================
+ Hits        52001    52252     +251     
- Misses      10136    10149      +13

Files	Coverage Δ
src/properties/degrees.c	`90.45% <25.00%> (+0.07%)`	⬆️
src/misc/mixing.c	`93.38% <89.10%> (-2.53%)`	⬇️

... and 6 files with indirect coverage changes

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update e6b49f4...af0ebc8. Read the comment docs.

szhorvat · 2023-10-30T18:42:47Z

Here's a thought: Perhaps we should generalize this to work with arbitrary integer "vertex types", which can be used in place of the degrees? Then supplying in- or out-degrees would be just one choice.

szhorvat · 2023-10-30T20:08:47Z

Here's a thought: Perhaps we should generalize this to work with arbitrary integer "vertex types", which can be used in place of the degrees? Then supplying in- or out-degrees would be just one choice.

This is now implemented as igraph_mixing_matrix(). I believe this name is used in the social sciences, but perhaps it is not the best fit here, given that it is a counterpart of igraph_joint_degree_distribution() and not igraph_joint_degree_matrix(). I would rather go with igraph_joint_type_distribution(). Calling it "joint distribution" encodes the fact that it is over ordered pairs.

@mbojan, perhaps you have some input on this function and its naming?

mbojan · 2023-11-02T23:21:08Z

@szhorvat , I didn't have time to study what you did in detail but if, in principle, you take an edge list (two-column matrix), replace node ids with corresponding values of some node attribute[s] (perhaps distinct attributes for source and target nodes in a digraph) and finally aggregate it by counting the unique combinations of values then indeed it is the contact layer of the mixing matrix. Thus function name seems appropriate.

src/misc/mixing.c

ntamas · 2023-11-03T09:10:23Z

src/misc/mixing.c

+ * \param graph The input graph.
+ * \param m The mixing matrix will be stored here.
+ * \param from_types Vertex types for source vertices. These must be non-negative integers.
+ * \param to_types Vertex types for target vertices. These must be non-negative integers.


Does it make sense to have to_types for undirected graphs? I would assume it does as each undirected edge is counted in "both" directions and the types of the vertices can then differ in both configurations, but I wonder whether it would really make a difference.

Good question. The matrix would be different, since the row and column counts may not be the same anymore. Is it a useful matrix? I am not sure. I would leave it as-is.

Approaches to MM of undirected graphs vary. In https://mbojan.github.io/netseg/reference/mixingm.html I count each edge once (so the sum of contact layer of MM is a proper edge count) and fold all counts to upper triangle so that lower triangle is all 0s. On the other hand https://github.com/statnet/network/blob/5fb95d2feddfd2a0fa3f234a102fe2992203c7c7/R/misc.R#L120 returns always a symmetric matrix, but then all the between-group ties are counted twice.

szhorvat · 2023-11-06T16:52:36Z

I renamed the second function to igraph_joint_type_distribution() for consistency with joint_degree_distribution(), but left the term "mixing matrix" in the short description for the sake of searchability in the index.

Test for igraph_joint_degree_distribution() is added. One more piece is missing, a test for igraph_joint_type_distribution(). But that function is almost certainly fine, since it shares an implementation with igraph_joint_degree_distribution().

szhorvat · 2023-11-06T18:22:13Z

@ntamas This is now ready for review (and hopefully merging, assuming tests pass). I'd like this in the next release.

szhorvat · 2023-11-06T18:45:18Z

There's a small bug in the one feature I didn't test ...

…bution()

szhorvat · 2023-11-06T19:12:13Z

It's good now.

…ector, joint_degree_distribution, joint_type_distribution

ntamas · 2023-11-07T21:16:01Z

Thanks a lot!

szhorvat · 2023-11-07T23:04:51Z

@ntamas Thanks! AFAIC we're good to release 0.10.8. The one thing I'd do beforehand is check on why Travis is not working, and if easy, run the Travis tests once (it's exotic hardware and old compilers, so it's good to run a test before release).

szhorvat mentioned this pull request Oct 30, 2023

feat: igraph_joint_degree_distribution #2420

Closed

szhorvat requested a review from ntamas October 30, 2023 16:44

szhorvat force-pushed the feat/joint-degree-distribution branch from c86cb51 to 88b73a1 Compare October 30, 2023 16:45

szhorvat force-pushed the feat/joint-degree-distribution branch 2 times, most recently from a5ec8fc to 7c49d0c Compare October 30, 2023 17:11

szhorvat force-pushed the feat/joint-degree-distribution branch 3 times, most recently from 138d9eb to 704a4a7 Compare October 30, 2023 21:47

ntamas reviewed Nov 3, 2023

View reviewed changes

szhorvat force-pushed the feat/joint-degree-distribution branch 5 times, most recently from 1dbd300 to 6f61bc7 Compare November 6, 2023 16:46

szhorvat force-pushed the feat/joint-degree-distribution branch from f9dd062 to 2509e1d Compare November 6, 2023 18:20

szhorvat marked this pull request as ready for review November 6, 2023 18:21

szhorvat force-pushed the feat/joint-degree-distribution branch from 2509e1d to 2b8ee54 Compare November 6, 2023 18:24

szhorvat marked this pull request as draft November 6, 2023 18:45

szhorvat added 4 commits November 6, 2023 19:01

feat: igraph_joint_degree_distribution() and igraph_joint_type_distri…

d7d54bc

…bution()

tests: igraph_joint_degree_distribution()

082edc3

docs: more cross-referencing for degree corrrelation functions

260d0dd

chore: update changelog

5396b00

szhorvat force-pushed the feat/joint-degree-distribution branch from 2b8ee54 to 5396b00 Compare November 6, 2023 19:01

szhorvat marked this pull request as ready for review November 6, 2023 19:12

fix: add missing weight vector length checks for degree_correlation_v…

af0ebc8

…ector, joint_degree_distribution, joint_type_distribution

ntamas approved these changes Nov 7, 2023

View reviewed changes

ntamas merged commit d3ba879 into master Nov 7, 2023
34 checks passed

ntamas deleted the feat/joint-degree-distribution branch November 7, 2023 21:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: igraph_joint_degree_distribution #2422

feat: igraph_joint_degree_distribution #2422

szhorvat commented Oct 30, 2023 •

edited

codecov bot commented Oct 30, 2023 •

edited

szhorvat commented Oct 30, 2023

szhorvat commented Oct 30, 2023 •

edited

mbojan commented Nov 2, 2023 •

edited

ntamas Nov 3, 2023

szhorvat Nov 6, 2023

mbojan Nov 8, 2023 •

edited

szhorvat commented Nov 6, 2023

szhorvat commented Nov 6, 2023 •

edited

szhorvat commented Nov 6, 2023

szhorvat commented Nov 6, 2023

ntamas commented Nov 7, 2023

szhorvat commented Nov 7, 2023

feat: igraph_joint_degree_distribution #2422

feat: igraph_joint_degree_distribution #2422

Conversation

szhorvat commented Oct 30, 2023 • edited

codecov bot commented Oct 30, 2023 • edited

Codecov Report

szhorvat commented Oct 30, 2023

szhorvat commented Oct 30, 2023 • edited

mbojan commented Nov 2, 2023 • edited

ntamas Nov 3, 2023

Choose a reason for hiding this comment

szhorvat Nov 6, 2023

Choose a reason for hiding this comment

mbojan Nov 8, 2023 • edited

Choose a reason for hiding this comment

szhorvat commented Nov 6, 2023

szhorvat commented Nov 6, 2023 • edited

szhorvat commented Nov 6, 2023

szhorvat commented Nov 6, 2023

ntamas commented Nov 7, 2023

szhorvat commented Nov 7, 2023

szhorvat commented Oct 30, 2023 •

edited

codecov bot commented Oct 30, 2023 •

edited

szhorvat commented Oct 30, 2023 •

edited

mbojan commented Nov 2, 2023 •

edited

mbojan Nov 8, 2023 •

edited

szhorvat commented Nov 6, 2023 •

edited