-
Notifications
You must be signed in to change notification settings - Fork 28.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-3650] Fix TriangleCount handling of reverse edges #2495
Conversation
Can one of the admins verify this patch? |
ok to test |
QA tests have started for PR 2495 at commit
|
QA tests have finished for PR 2495 at commit
|
Test PASSed. |
What is the status on this patch? |
d054d33
to
0461ed0
Compare
Test build #23301 has finished for PR 2495 at commit
|
@ankurdave take a look at this when you get a chance. |
@rxin and @ankurdave what is the status on this? |
Jenkins, retest this please. |
Test build #25737 has finished for PR 2495 at commit
|
@jegonzal I fixed the compile error in jegonzal#3. |
Great! What else needs to be done? There was some discussion about how |
I think it just adds a bit of a slowdown for already-canonicalized graphs, which is probably fine considering how expensive triangle count is anyway. |
I'd like to close this issue pending further updates. |
@pwendell can this be opened again? As per my discussion on the the JIRA ticket this is an issue that came up on the mailing list recently. |
jegonzal ankurdave please could you review ## What changes were proposed in this pull request? Reworking of jegonzal PR #2495 to address the issue identified in SPARK-3650. Code amended to use the convertToCanonicalEdges method. ## How was the this patch tested? Patch was tested using the unit tests created in PR #2495 Author: Robin East <robin.east@xense.co.uk> Author: Joseph E. Gonzalez <joseph.e.gonzalez@gmail.com> Closes #11290 from insidedctm/spark-3650.
This PR causes the TriangleCount algorithm to remove self-edges, direct edges from low-id to high-id (canonical direction), and then remove duplicate edges, before running the triangle count algorithm.