mapreduceTemplate

Template for running mapreduce jobs against Cloudera CDH4.4

Examples: WordCount - Classic example of counting occurrence of words within multiple files. Hive2Text - An example of outputting the content of rows in an external Hive table into a text file.

How to use/build locally:

Clone the repository
Import the directory as a maven project into Eclipse
Hack the examples into using local paths relative to the project root (as opposed to argument paths)

How to use/build for running on a hadoop cluster:

run: mvn assembly:assembly
copy (if necessary) the target/*-jar-with-dependencies.jar file to your hadoop box (where you'll be launching your job from)
as described in the examples, run hadoop jar <...-jar-with-dependencies.jar> <example class, eg: org.mapreduce.examples.WordCount>

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
src/main/java/org/mapreduce/examples		src/main/java/org/mapreduce/examples
.classpath		.classpath
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mapreduceTemplate

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

mapreduceTemplate

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages