Implement eigenvector centrality #2287

ChuckHastings · 2022-05-19T03:05:53Z

This PR implements Eigenvector Centrality in C++ using the graph primitives. It also provides the C API implementation.

There are unit tests for C++ and C both SG and MG.

Partially addresses #2146

codecov-commenter · 2022-05-19T06:57:03Z

Codecov Report

Merging #2287 (081134f) into branch-22.06 (d9ec8f7) will decrease coverage by 0.13%.
The diff coverage is 80.00%.

❗ Current head 081134f differs from pull request most recent head f86f15e. Consider uploading reports for the commit f86f15e to get more accurate results

@@               Coverage Diff                @@
##           branch-22.06    #2287      +/-   ##
================================================
- Coverage         63.82%   63.69%   -0.14%     
================================================
  Files               100      100              
  Lines              4484     4481       -3     
================================================
- Hits               2862     2854       -8     
- Misses             1622     1627       +5

Impacted Files	Coverage Δ
python/cugraph/cugraph/sampling/node2vec.py	`81.81% <33.33%> (ø)`
python/cugraph/cugraph/gnn/graph_store.py	`80.00% <100.00%> (-2.61%)`	⬇️
python/cugraph/cugraph/utilities/utils.py	`73.79% <100.00%> (+0.86%)`	⬆️
...n/pylibcugraph/pylibcugraph/utilities/api_tools.py	`88.05% <0.00%> (-7.47%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update d9ec8f7...f86f15e. Read the comment docs.

seunghwak · 2022-05-19T16:15:56Z

cpp/include/cugraph/algorithms.hpp

+void eigenvector_centrality(
+  raft::handle_t const& handle,
+  graph_view_t<vertex_t, edge_t, weight_t, true, multi_gpu> const& graph_view,
+  raft::device_span<weight_t> centralities,


Just for the sake of discussion,

So, what do you think about passing raft::device_span<weight_t> centralities as an input argument vs returning rmm::device_uvector<weight_t> holding centrality values?

The former might be more natural when we're passing initial values and we may be able to reduce memory allocations (when we are running PageRank with different personalization vectors, but with the rmm pool allocator, memory allocation overhead might be insignificant) while the latter might be more functional.

I got the idea of using the span from looking at your new triangle_count implementation. The [in/out] of centralities is more consistent with what we have been doing. Our paradigm thus far has been to specify the output storage a priori if we can know it, and to allocate it dynamically if we can't know it.

What you are suggesting would be a paradigm shift for the API. I'm not opposed to changing the paradigm.

It seems to me the current paradigm has the following advantages:

Less memory allocation. The new strategy would require temporarily having an extra vector of length V.

The caller can use any memory allocator that they choose to allocate the device memory

The new paradigm would have the following advantages:

More functional in nature

More consistency (all algorithms would return results the same way, whether the size is predictable or not)

In the grand scheme of memory things, I'm not all that concerned over allocating an extra result array temporarily. It seems to me that the functional feel of the proposed paradigm is useful and consistency in how algorithms behave across the interface is always better.

In this case I can certainly change raft::device_span<weight_t> centralities to std::optional< raft::device_span<weight_t>> centralities to support an optional input, and make the return value rmm::device_uvector<weight_t>

seunghwak · 2022-05-19T16:16:09Z

cpp/include/cugraph/algorithms.hpp

+ * @param handle RAFT handle object to encapsulate resources (e.g. CUDA stream, communicator, and
+ * handles to various CUDA libraries) to run graph algorithms.
+ * @param graph_view Graph view object.
+ * @param centralities Device span where we should store the eigenvector centralities


Can we pass initial values?

I will add that support. Missed that.

seunghwak · 2022-05-19T16:24:24Z

cpp/src/centrality/eigenvector_centrality_impl.cuh

+#include <rmm/exec_policy.hpp>
+
+#include <thrust/fill.h>
+#include <thrust/for_each.h>


Is this necessary?

Probably not, copy/paste. I'll check all the headers.

Don't forget to delete this.

seunghwak · 2022-05-19T16:26:37Z

cpp/src/centrality/eigenvector_centrality_impl.cuh

+  thrust::fill(handle.get_thrust_policy(),
+               centralities.begin(),
+               centralities.end(),
+               weight_t{1.0} / static_cast<weight_t>(num_vertices));


NetworkX supports passing initial values (https://networkx.org/documentation/stable/reference/algorithms/generated/networkx.algorithms.centrality.eigenvector_centrality.html). Shouldn't we support the same (we support initial values for PageRank).

Will add, missed that.

ChuckHastings · 2022-05-19T22:42:38Z

Pushed an update to address @seunghwak comments

ChuckHastings · 2022-05-20T19:22:39Z

@gpucibot merge

ChuckHastings added 2 commits May 18, 2022 23:01

add eigenvector centrality implementation

f438a37

Merge branch 'branch-22.06' into fea_implement_eigenvector_centrality

39e4126

ChuckHastings requested review from a team as code owners May 19, 2022 03:05

ChuckHastings self-assigned this May 19, 2022

ChuckHastings added 3 - Ready for Review improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels May 19, 2022

ChuckHastings added this to the 22.06 milestone May 19, 2022

ChuckHastings requested a review from seunghwak May 19, 2022 03:24

seunghwak reviewed May 19, 2022

View reviewed changes

ChuckHastings added 2 commits May 19, 2022 18:38

Change eigenvector API to be more functional

9676970

update C API tests with new C++ API for eigenvector centrality

6a50ee2

fix clang-format issues

7ed2c4b

seunghwak approved these changes May 20, 2022

View reviewed changes

ChuckHastings added 2 commits May 20, 2022 11:08

fix clang-format issues

a9a2579

delete unnecessary include

f86f15e

rapids-bot bot merged commit 2e23132 into rapidsai:branch-22.06 May 20, 2022

ChuckHastings deleted the fea_implement_eigenvector_centrality branch August 4, 2022 18:26

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement eigenvector centrality #2287

Implement eigenvector centrality #2287

ChuckHastings commented May 19, 2022 •

edited

Loading

codecov-commenter commented May 19, 2022 •

edited

Loading

seunghwak May 19, 2022

ChuckHastings May 19, 2022

ChuckHastings May 19, 2022

seunghwak May 19, 2022

ChuckHastings May 19, 2022

seunghwak May 19, 2022

ChuckHastings May 19, 2022

seunghwak May 20, 2022

ChuckHastings May 20, 2022

seunghwak May 19, 2022

ChuckHastings May 19, 2022

ChuckHastings commented May 19, 2022

ChuckHastings commented May 20, 2022

Implement eigenvector centrality #2287

Implement eigenvector centrality #2287

Conversation

ChuckHastings commented May 19, 2022 • edited Loading

codecov-commenter commented May 19, 2022 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ChuckHastings commented May 19, 2022

ChuckHastings commented May 20, 2022

ChuckHastings commented May 19, 2022 •

edited

Loading

codecov-commenter commented May 19, 2022 •

edited

Loading