Create your own GitHub profile
Sign up for your own profile on GitHub, the best place to host code, manage projects, and build software alongside 28 million developers.Sign up
- Sign in to view email
NRT Sessionization with Spark Streaming landing on HDFS and putting live stats in HBase
Based off the design of SparkOnHBase. This Repo will support Spark, Spark Streaming, and Spark SQL integration with Kudu.
Simple Spark example of generating table stats for use of data quality checks
Examples of Integrating Spark Streaming, Flume, and HBase to solve Streaming problems
Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet