a Map/Reduce framework for distributed computing
Python JavaScript Erlang Other
Pull request Compare This branch is 1 commit ahead, 3040 commits behind discoproject:master.
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
aws
bin
conf
doc
examples/datamining
ext
master
node
pydisco
test
tests
util
.gitignore
AUTHORS
LICENSE
Makefile
README.rst

README.rst

Disco - Massive data, Minimal code

Disco is an implementation of the Map-Reduce framework for distributed computing. As the original framework, which was publicized by Google, Disco supports parallel computations over large data sets on unreliable cluster of computers. This makes it a perfect tool for analyzing and processing large datasets without having to bother about difficult technical questions related to distributed computing, such as communication protocols, load balancing, locking, job scheduling or fault tolerance, which are taken care by Disco.

See discoproject.org for more information.