Skip to content

ashutosh091/hadoopmr

Repository files navigation

hadoopmr

Hadoop and Map Reduce Code

Student Times Exercise - The solution is in folder "activeHours"

Post and Answer Length Exercise - The solution is in folder "postSize"

Top Tags Exercise - The solution is in folder "topTags"

Study Groups Exercise - The solution is in folder "studyGroup"

Search improvement question - Implemented the solution in folder "reverse_index"

Other ideas about the Forum datasets - Implemented two suggestions, active_threads -> finds which threads were active (new question added, new replies/comments) in last 24 hours. hottest_threads -> threads which have more than x number of replies/comments (in mapper.py I have set it to 10 but we can change it)

About

Hadoop and Map Reduce Code

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published