Python Streaming Example
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
README
countries.dat
customers.dat
run.sh
smplMapper.py
smplReducer.py

README

Python Streaming Sample
-----------------------

Slides from my NYC Hadoop Meetup talk http://www.slideshare.net/charmalloc/hadoop-streaming-tutorial-with-python

Blog post this sample code is for http://allthingshadoop.com/2011/12/16/simple-hadoop-streaming-tutorial-using-joins-and-keys-with-python/

How to run this sample
----------------------

./run.sh

or

cat countries.dat customers.dat |./smplMapper.py |sort|./smplReducer.py