GiraphAnalysis

##installation

Graph analysis using the Giraph apache incubator project

To compile:
mvn package

Running the analysis requires a compiled trunk version of Giraph:

git clone git@github.com:apache/giraph.git  
cd giraph  
mvn install

##Analysis ###pagerank (using an edgelist of strings as input)

hadoop jar target/Analysis-0.0.1-SNAPSHOT-jar-with-dependencies.jar\  
org.apache.giraph.GiraphRunner  org.data2semantics.giraph.pagerank.PageRankComputation \  
-eif org.data2semantics.giraph.io.EdgeListStringReader  \  
-of org.apache.giraph.io.formats.IdWithValueTextOutputFormat \  
-mc org.data2semantics.giraph.pagerank.RandomWalkVertexMasterCompute \  
-wc org.data2semantics.giraph.pagerank.RandomWalkWorkerContext
-op <outputPath> \  
-eip <input edge list> \  
-w <number of workers>

This executed pagerank for inputfile <input edge list>, writes output to directory <outputPath>, and uses this amount of workers: <number of workers> ###pagerank (using an edgelist of longs as input)

hadoop jar target/Analysis-0.0.1-SNAPSHOT-jar-with-dependencies.jar\
org.apache.giraph.GiraphRunner  org.data2semantics.giraph.pagerank.numerical.PageRankComputation\
-eif org.data2semantics.giraph.io.EdgeListLongReader\
-of org.apache.giraph.io.formats.IdWithValueTextOutputFormat
-mc org.data2semantics.giraph.pagerank.numerical.RandomWalkVertexMasterCompute\
-wc org.data2semantics.giraph.pagerank.numerical.RandomWalkWorkerContext\
-op <outputPath> \  
-eip <input edge list> \  
-w <number of workers>

This executed pagerank for inputfile <input edge list>, writes output to directory <outputPath>, and uses this amount of workers: <number of workers>

###outdegree

hadoop jar target/Analysis-0.0.1-SNAPSHOT-jar-with-dependencies.jar\  
org.apache.giraph.GiraphRunner  org.data2semantics.giraph.pagerank.SimpleOutDegreeCountComputation \  
-eif org.data2semantics.giraph.io.EdgeListReader  \  
-of org.apache.giraph.io.formats.IdWithValueTextOutputFormat \  
-op <outputPath> \  
-eip <input edge list> \  
-w <number of workers>

This executed pagerank for inputfile <input edge list>, writes output to directory <outputPath>, and uses this amount of workers: <number of workers>

###outdegree

hadoop jar target/Analysis-0.0.1-SNAPSHOT-jar-with-dependencies.jar\  
org.apache.giraph.GiraphRunner  org.data2semantics.giraph.pagerank.SimpleInDegreeCountComputation \  
-eif org.data2semantics.giraph.io.EdgeListReader  \  
-of org.apache.giraph.io.formats.IdWithValueTextOutputFormat \  
-op <outputPath> \  
-eip <input edge list> \  
-w <number of workers>

This executed pagerank for inputfile <input edge list>, writes output to directory <outputPath>, and uses this amount of workers: <number of workers>

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
src		src
.gitignore		.gitignore
README.md		README.md
log4j.properties		log4j.properties
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

src

src

.gitignore

.gitignore

README.md

README.md

log4j.properties

log4j.properties

pom.xml

pom.xml

Repository files navigation

GiraphAnalysis

About

Releases

Packages

Contributors 2

Languages

aaai2014sampld/GiraphAnalysis

Folders and files

Latest commit

History

Repository files navigation

GiraphAnalysis

About

Resources

Stars

Watchers

Forks

Languages