Skip to content
This repository


Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP

Shark (Hive on Spark)

Shark is a large-scale data warehouse system for Spark designed to be compatible with Apache Hive. It can answer Hive QL queries up to 100 times faster than Hive without modification to either the existing data or queries. Shark supports Hive's query language, metastore, serialization formats, and user-defined functions.

Shark 0.9.0 requires:

  • Scala 2.10.3
  • AMPLab's Hive 0.11
  • Spark 0.9.x

For current documentation, see the Shark Project Wiki

Something went wrong with that request. Please try again.