Skip to content

temur-kh/big-data-homework-01

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

36 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Assignment №1. MapReduce. Simple Search Engine

Usage guide

The following is the command syntax for running Indexer:

$ hadoop/bin/hadoop jar <jar file name>.jar Indexer <path to corpus on HDFS> <output path on HDFS>
e.g.: $ hadoop/bin/hadoop jar IDB-HW1.jar Indexer /EnWikiSmall /indexer_results

Here is the command syntax for running Query:

$ hadoop/bin/hadoop jar <jar file name>.jar Query <path to Indexer output on HDFS> <query string inside quotes> <number of most relevant docs to show>
e.g: $ hadoop/bin/hadoop jar IDB-HW1.jar Query /indexer_results "some query" 3

About

Homework #1 of Introduction to Big Data

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages