public
Description: Python module that allows you to easily write and run Hadoop programs.
Homepage: http://last.fm/dumbo
Clone URL: git://github.com/klbostee/dumbo.git
Klaas Bosteels (author)
Mon Nov 03 03:24:21 -0800 2008
commit  c35c9bb748b1ee2348a96d53c4c025dc0cbf425c
tree    99d0132f54b6032485ad4f4be568567c1b7c6f0d
parent  993d1a5ef6c11ba8d5a257d5b0b4cb8e80a15f95
dumbo /
name age message
file README Loading commit data...
file build.xml
directory examples/
directory src/
README
INSTALLATION
""""""""""""

python setup.py install


USAGE
"""""

/usr/local/hadoop/bin/hadoop dfs -put examples/brian.txt brian.txt

python examples/wordcount.py -hadoop /path/to/hadoop \
-file excludes.txt -input brian.txt -output brian-wc

/usr/local/hadoop/bin/hadoop dfs -getmerge brian-wc brian-wc.txt


MORE INFO
"""""""""

http://github.com/klbostee/dumbo/wikis