This repository is private.
All pages are served over SSL and all pushing and pulling is done over SSH.
No one may fork, clone, or view it unless they are added as a member.
Every repository with this icon (
) is private.
Every repository with this icon (
This repository is public.
Anyone may fork, clone, or view it.
Every repository with this icon (
) is public.
Every repository with this icon (
Klaas Bosteels (author)
Mon Jan 05 10:01:23 -0800 2009
commit 90afb019792f37fe743efa4174483d2d33d5da9c
tree 2cd49a4ce0afb66e9131a6a8e315cd7ef5c643b5
parent 67b8fb44e5a3c1d9a7b9bd88342d69a94d074b42
tree 2cd49a4ce0afb66e9131a6a8e315cd7ef5c643b5
parent 67b8fb44e5a3c1d9a7b9bd88342d69a94d074b42
dumbo /
| name | age | message | |
|---|---|---|---|
| |
README | Fri Dec 26 03:51:46 -0800 2008 | |
| |
bin/ | Fri Dec 26 03:51:46 -0800 2008 | |
| |
build-pymod.xml | Tue Nov 04 03:28:22 -0800 2008 | |
| |
build.xml | Fri Dec 26 03:18:58 -0800 2008 | |
| |
examples/ | Sat Dec 27 02:50:45 -0800 2008 | |
| |
src/ |
README
DESCRIPTION """"""""""" Originally, Dumbo was just a simple Python module that made writing and running Streaming programs very easy, but now it also consists of some helper code in Java. More generally, Dumbo can be considered to be a convenient Python API for writing MapReduce programs. INSTALLATION """""""""""" Dumbo should get built together with the rest of Hadoop when the "dumbo/" directory is put in Hadoop's "src/contrib/" directory. More precisely, a "build/hadoop-*/contrib/dumbo/" directory should be generated when you run "ant package" in Hadoop's root directory. USAGE """"" contrib/dumbo/bin/put examples/brian.txt brian.txt contrib/dumbo/bin/start examples/wordcount.py \ -input brian.txt -output brian-wc -inputformat text contrib/dumbo/bin/cat brian-wc > brian-wc.txt MORE INFO """"""""" http://github.com/klbostee/dumbo/wikis








