Skip to content
This repository has been archived by the owner on Apr 27, 2018. It is now read-only.

Represent link structure as graph using GraphX #201

Closed
jrwiebe opened this issue Feb 29, 2016 · 4 comments
Closed

Represent link structure as graph using GraphX #201

jrwiebe opened this issue Feb 29, 2016 · 4 comments

Comments

@jrwiebe
Copy link
Collaborator

jrwiebe commented Feb 29, 2016

Once we obtain a graph representation of our site link structure within Spark/Warcbase, we will be able to further simplify operations that currently depend on other tools (e.g. Gephi, for PageRank).

https://spark.apache.org/docs/latest/graphx-programming-guide.html#pagerank

@ianmilligan1
Copy link
Collaborator

This is great. Baking PageRank as per these docs into warcbase would be perfect, as a way to extract an ordered list of relevant resources.

@ianmilligan1
Copy link
Collaborator

Probably worth merging with #183. The fork of @aliceranzhou's link-structure repo is at https://github.com/shamrt/link-structure. Are we getting close to being able to incorporate into warcbase, either this repo or docs?

@ianmilligan1
Copy link
Collaborator

Just pinging again. How close is this branch being ready to incorporate into main? (would be nice to include in the write-up of warcbase we're doing!)

@jrwiebe
Copy link
Collaborator Author

jrwiebe commented Mar 17, 2016

I will take care of this shortly.
On Mar 17, 2016 1:19 PM, "Ian Milligan" notifications@github.com wrote:

Just pinging again. How close is this branch being ready to incorporate
into main? (would be nice to include in the write-up of warcbase we're
doing!)


You are receiving this because you authored the thread.
Reply to this email directly or view it on GitHub
#201 (comment)

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants