Skip to content

Latest commit

 

History

History
28 lines (17 loc) · 1.22 KB

README.md

File metadata and controls

28 lines (17 loc) · 1.22 KB

Splout SQL: A web-latency SQL spout for Hadoop.

Splout is a scalable, open-source, easy-to-manage SQL big data view. Splout is to Hadoop + SQL what Voldemort or Elephant DB are to Hadoop + Key/Value. Splout serves a read-only, partitioned SQL view which is generated and indexed by Hadoop. In a nutshell:

For up-to-date documentation and examples please visit the official website

Quick steps to compile it:

  1. Compile it adapting the different pom.xml to your Hadoop distro versions. Use "mr2" profile for MR2 versions.

mvn clean install -DskipTests

The default version compiles the distribution for MR2 versions (MapReduce 2), the most common use case. Other available maven profiles are:

  • mr1: MarReduce 1 version
  • cdh5: Cloudera CDH5 version

For example, for compiling for CDH5 version you have to launch:

mvn clean install -DskipTests -Pcdh5

If you need to compile against a particular Hadoop version, you'll need to modify the different pom.xml acordingly. You can use the cdh5 profile as reference.

  1. Once compiled you'll find the distribution binaries are at file assembly/target/splout-distribution-*.tar.gz ready for being installed.