Master repository for hadoop jobs
This is the hadoop classic WordCount example, created with Matthias Friedrich's maven archetype for hadoop jobs. This project was created by following this blog, go there for details. Thanks Matthias.
To run copy to the cluster and execute:
hadoop jar hadoop-job1.jar org.javasucks.hadoop.WordCount /falcon/demo/bcp/processed/enron/2014-02-28-00 /user/hue/wordcount
A command line utility used to upload a local directory with small files in the local filesystem to a a single SequenceFile in a remote hdfs cluster.
To run java -jar uploadfiles.jar
.
To build and generate a single runnable jar file with dependencies run mvn clean package
.