A web-latency SQL spout for Hadoop.
Java JavaScript HTML Shell Other
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
assembly
splout-commons
splout-hadoop
splout-javaclient
splout-mysql
splout-redis
splout-resources
splout-server
.gitignore
CHANGES.txt
README.md
Splout_SQL.jpg
pom.xml

README.md

Splout SQL: A web-latency SQL spout for Hadoop.

Splout is a scalable, open-source, easy-to-manage SQL big data view. Splout is to Hadoop + SQL what Voldemort or Elephant DB are to Hadoop + Key/Value. Splout serves a read-only, partitioned SQL view which is generated and indexed by Hadoop. In a nutshell:

For up-to-date documentation and examples please visit the official website

Quick steps to compile it:

  1. Compile it adapting the different pom.xml to your Hadoop distro versions. Use "mr2" profile for MR2 versions.

mvn clean install -DskipTests

The default version compiles the distribution for MR2 versions (MapReduce 2), the most common use case. Other available maven profiles are:

  • mr1: MarReduce 1 version
  • cdh5: Cloudera CDH5 version

For example, for compiling for CDH5 version you have to launch:

mvn clean install -DskipTests -Pcdh5

If you need to compile against a particular Hadoop version, you'll need to modify the different pom.xml acordingly. You can use the cdh5 profile as reference.

  1. Once compiled you'll find the distribution binaries are at file assembly/target/splout-distribution-*.tar.gz ready for being installed.