Skip to content
This repository has been archived by the owner on Mar 30, 2021. It is now read-only.

Home old

hbutani edited this page Aug 4, 2016 · 1 revision

#Fast BI using Spark and Druid.

This project is aimed at two classes of users

  1. Users of Druid who want SQL access to their indexes and use traditional BI tools such as Tableau with Druid
  2. Spark and Hive users who find performance of their interactive BI painfully slow.

Where to start.

  1. Quick Start
  2. Using Tableau with Sparkline
  3. The Druid project
  4. Spark

##Indexing

  1. Indexing TPCH data as an example.

  2. Setting up Druid Druid.

Setting up the data

  1. Sample data set for TPCH.

  2. Demo with sample dataset

##Querying data from Spark

  1. Setup thrift server connections so you can use Squirrel, Razor SQL, Zeppelin or Tableau against the datasets.

  2. Sample Queries.

Clone this wiki locally