zohmg / zohmg

Zohmg is a data store for aggregation of multi-dimensional time series data, built on top of Hadoop, Dumbo and HBase.

This URL has Read+Write access

avtobiff (author)
Wed Aug 19 06:43:29 -0700 2009
commit  e8b962370e879d975969e15c8f100ada1af9ab8b
tree    69d62d2202ed5ec981c1ff85a94e5ba2dc16d350
parent  559df131b6c866f3e1bea72bab95520999bf20bf
zohmg / DEPENDENCIES
100644 56 lines (37 sloc) 1.414 kb
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
 
DEPENDENCIES
------------
 
Zohmg depends on a number of software packages and Python modules.
 
The installation procedure described in INSTALL will take care of
these dependencies for you, except for patched versions of
Hadoop and HBase. Please see scripts/install_hadoop_hbase.sh
for an assisted installation of Hadoop and HBase.
 
 
The following software is required:
 - Apache Hadoop 0.20 with patches HADOOP-1722 and HADOOP-5450.
 - Apache HBase 0.20
 - Dumbo >=0.21.17
 
These python modules come bundled:
 - PyYAML
 - Python Paste
 - Python simplejson
 
 
 
# Dumbo (>=0.21.17)
 
    Dumbo is a convenient Python API for writing MapReduce programs.
 
    more:
     + http://dumbotics.com/
     + http://klbostee.github.com/dumbo/
     + http://wiki.github.com/klbostee/dumbo.
 
# Apache HBase (0.20)
 
    HBase is a column-based datastore inspired by BigTable.
 
    more:
     + http://hbase.org/
     + http://labs.google.com/papers/bigtable.html
 
# Apache Hadoop (0.20)
 
    Hadoop is a MapReduce framework.
 
    Additionally Hadoop source has to be patched with HADOOP-1722 and
    HADOOP-5450:
     + https://issues.apache.org/jira/browse/HADOOP-1722
     + https://issues.apache.org/jira/browse/HADOOP-5450
 
    more:
     + http://hadoop.org/
     + http://en.wikipedia.org/wiki/Hadoop
     + http://labs.google.com/papers/mapreduce.html
     + http://dumbotics.com/2009/02/24/hadoop-1722-and-typed-bytes/