Skip to content

tumaolin94/HadoopInvertedIndex

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

HadoopInvertedIndex

  • Implement a inverted index function on Hadoop.

DataSet

Deployed on Google Cloud

  • Output format

Word BookName: counts

Example: a Henry James___Four Meetings: 438 Henry James___Georgina's Reasons: 1708

means the word a occurs 438 times in book Henry James___Four Meetings, 1708 times in book Henry James___Georgina's Reasons

Local test method:

<input_path>
<output_path>
-map <docID2Name_path>

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages