[REVIEW] Personalized Page Rank by kaatish · Pull Request #300 · rapidsai/cugraph

kaatish · 2019-05-24T18:14:38Z

No description provided.

afender

Personalized Page Rank looks great, surgically inserted in the exiting Pagerank code and nicely exposed with the Nx-like API 👍

I'd just recommend a couple of python testing additions to cover cyber use case and the newly added "guess" feature

python/cugraph.egg-info/PKG-INFO

afender · 2019-05-24T18:51:56Z

cpp/include/algorithms.h

+ * @Param[in] has_guess              This parameter is used to notify cuGRAPH if it should use a user-provided initial guess. False means the user doesn't have a guess, in this case cuGRAPH will use a uniform vector set to 1/V.
+ *                                   If the value is True, cuGRAPH will read the pagerank parameter and use this as an initial guess.
+ *                                   The initial guess must not be the vector of 0s. Any value other than 1 or 0 is treated as an invalid value.
+ * @Param[in] pagerank (optional)    Initial guess if has_guess=true


We should test the newly exposed guess support at the python level too

Discussed on slack. Plan :
Since we run cugraph after Nx we could provide Nx output as guess to cugraph and expect to converge in 1 or 2 iterations.

afender · 2019-05-24T19:03:00Z

python/cugraph/pagerank/test_pagerank.py

 MAX_ITERATIONS = [500]
 TOLERANCE = [1.0e-06]
 ALPHA = [0.85]
+PERSONALIZATION_PERC = [0, 10, 50]


I think we should add a comment explaining how these parameters impact the way the personalization is generated in networkx_call .

afender · 2019-05-24T19:18:46Z

python/cugraph/pagerank/test_pagerank.py

        raise TypeError('Shape is not square')

+    personalization = None
+    if personalization_perc != 0:


We should explain how the personalization vector is set. I'm not completely sure about what's in there in the end.

In cyber, some users don't need values. They just have a set of vertex they want to parametrize. We would need to add a test for that where val[i] = 1.0/num_personalized_vertices if i is parametrized and 0.0 otherwise.

Fixed here

That case is already tested in personalization=None

cpp/include/algorithms.h

afender · 2019-05-24T19:40:10Z

cpp/src/link_analysis/pagerank.cu

+  int prsLen = 0;
+  GDF_REQUIRE((personalization_subset == nullptr) == (personalization_values == nullptr), GDF_INVALID_API_CALL);
+  if (personalization_subset != nullptr) {
+    has_personalization = true;


What happens when personalization_subset != nullptr but its size is 0? Does it run the regular PageRank?

Good catch. Right now it will still try to run personalized page rank. This needs to be fixed to revert to normal pagerank.

afender · 2019-05-24T19:45:36Z

cpp/src/link_analysis/pagerank.cu

-  fill(n, b, randomProbability);
+  if (has_personalization) {
+    fill(n, b, static_cast<ValueType>(0));
+    scatter(prsLen, prsVal, b, prsVtx);


Should b sum to one from the mathematical perspective of the problem?
If so, what if the user's values don't? Should we normalize by nrm1 when it is larger or fill the rest of the vector with the correct uniform value when it is smaller? We could also return an error.

Discussed on slack with Aatish and Haekyu. We may normalize the input.

if |Q| == 1.0: continue else if Q == 0.0 <- uniform dist for all nodes else Q <- Q / |Q|

BradReesWork

Looks great. Just need to address Alex's comments

kaatish changed the title ~~Personalized Page Rank~~ [WIP] Personalized Page Rank May 24, 2019

kaatish changed the title ~~[WIP] Personalized Page Rank~~ [REVIEW] Personalized Page Rank May 24, 2019

afender self-requested a review May 24, 2019 18:38

afender reviewed May 24, 2019

View reviewed changes

cpp/include/algorithms.h Show resolved Hide resolved

afender reviewed May 24, 2019

View reviewed changes

afender added the 3 - Ready for Review label May 24, 2019

BradReesWork requested changes May 29, 2019

View reviewed changes

BradReesWork added the 4 - Waiting on Author label May 29, 2019

afender mentioned this pull request May 29, 2019

[gpuCI] Auto-merge branch-0.7 to branch-0.8 [skip ci] #290

Closed

Personalized Page Rank

b61476b

BradReesWork approved these changes May 31, 2019

View reviewed changes

BradReesWork merged commit 7c14d69 into rapidsai:branch-0.8 May 31, 2019

afender mentioned this pull request Jun 11, 2019

[FEA] NetworkX API matching - PageRank #212

Closed

1 task

Conversation

kaatish commented May 24, 2019

Uh oh!

afender left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

BradReesWork left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants