🔍PageRank

Setup

hadoop fs -put _movies/graph/adj_list /input to place the adj list in the input folder.

Commands

hadoop jar hadoop-streaming.jar -input /input/adj_list -output /out -mapper ./mapper.py -reducer reducer.py && hadoop fs -cat /out/part-00000 to rrun the Map Reduce algorithm. (Use adj file for smaller tests)
hadoop jar hadoop-streaming.jar -input /out/part-00000 -output /maxs -mapper ./max_page_rank.py && hadoop fs -cat /maxs/part-00000: to show the 20 top rresults

20 tops results

After the first iteration:

1138    0.000126
1139    0.000126
1140    0.000126
1141    0.000126
1142    0.000126
1143    0.000126
1170    0.000123
1197    0.000126
1198    0.000126
1199    0.000126
12      0.000134
1200    0.000126
1209    0.000126
1210    0.000126
3701    0.000147
3966    0.000120
4       0.000120
524     0.000120
5322    0.000128
93      0.000183

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔍PageRank

Setup

Commands

20 tops results

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
_movies		_movies
README.md		README.md
hadoop-streaming.jar		hadoop-streaming.jar
mapper.py		mapper.py
max_page_rank.py		max_page_rank.py
reducer.py		reducer.py

antoinewg/ocr-page-rank

Folders and files

Latest commit

History

Repository files navigation

🔍PageRank

Setup

Commands

20 tops results

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages