Set up:
- Download your .pem in AWS EC2
- Update your AWS AWSAccessKeyId and AWSSecretKey in config.properties file
- Open start-clushter.sh and update your security-group and keyPair values
- Open my-mapreduce.sh and update your keyPair value
- All map-reduce programs are in Hadoop/src/server folder
Instruction to run in distributed mode:
- ./start-cluster.sh <No.of instances> distributed
- ./my-mapreduce.sh wordcount s3:// s3:// distributed
- ./stop-cluster.sh
Instruction to run in pseudo mode:
- ./start-cluster.sh <No.of instances> pseudo
- ./my-mapreduce.sh wordcount s3:// s3:// pseudo
- ./stop-cluster.sh