Hadoop PageRank

PageRank algorithm implementation which make use of the Apache Hadoop framework.

Install Hadoop on your machine [OSX], [Linux]
Pick a dataset from the Stanford web graphs collection
Place the dataset in your Hadoop FS
Create the directory which will contain the output
Build a JAR using this source code and name it pagerank.jar
Launch the software using Hadoop: hadoop jar pagerank.jar --input <in> --output <out>
Browse the PageRank output result which can be found in the Hadoop FS

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
src/it/uniroma1/hadoop/pagerank		src/it/uniroma1/hadoop/pagerank
README.md		README.md

Provide feedback