Parallel Algorithms in Python for Hadoop/Mapreduce
Python Shell
Switch branches/tags
Nothing to show
Clone or download
Atbrox Merge pull request #1 from masayang/master
mrjob version
Latest commit 6885fbe Aug 10, 2012
Failed to load latest commit information.
README first commit Jun 13, 2010


Snabler - Parallel Algorithms in Python for Hadoop/Mapreduce

Contact for more info.

1. Which Algorithms are currently implemented in Snabler?
(So far) A Parallel Machine Learning Classifier for Hadoop Streaming in Python.

2. Which Algorithms will be implemented?
There is a potential backlog of hadoop/mapreduce algorithms here

3. Why the name Snabler?
The word Snabler is the Danish and Norwegian plural of an Elephant's Trunk (e.g. the Hadoop elephant), and shapewise referring to Python and plurality referring to parallelism.

4. Is Snabler open source?
Yes, Snabler is an open source project with an Apache Licence 2.0

5. Who is behind and develops Snabler?
Atbrox - a startup company that develops cloud & search software - is behind Snabler.

6. How do I contribute to Snabler?
Implement an algorithm in Python for Hadoop Streaming with methods def map(key,value) and def reduce(key,values), see this for an example implementation