Skip to content


Subversion checkout URL

You can clone with
Download ZIP
Fetching contributors…

Cannot retrieve contributors at this time

executable file 11 lines (7 sloc) 0.497 kB

Shark (Hive on Spark)

Shark is a large-scale data warehouse system for Spark designed to be compatible with Apache Hive. It can answer Hive QL queries up to 30 times faster than Hive without modification to the existing data nor queries. Shark supports Hive's query language, metastore, serialization formats, and user-defined functions.

Shark 0.2 requires Scala 2.9.2, Hive 0.9, and Spark 0.6.

For current documentation, see the Shark Project Wiki

Jump to Line
Something went wrong with that request. Please try again.