Block or report user
  • Cloudera
  • Simple Spark example of generating table stats for use of data quality checks

    Scala 15 17 Updated Apr 28, 2017
  • Fast scalable time series database

    Java 230 Updated Jan 22, 2017
  • Simple Example of HBase, SolR, and Kudu for Entity 360 using NY taxi data

    Scala 1 4 Updated Sep 27, 2016
  • An example of how to do a merge sort

    Scala Updated Sep 15, 2016
  • This tool is designed to look through your HDFS folders to ether identify files with no data in them or delete files with no data in them.

    Scala 2 1 Updated Aug 31, 2016
  • This project is a collection of Spark Unit Tests Examples to help new Spark users have good examples on how to unit start their code for Spark Core, Spark SQL, and Spark Streaming

    Scala 7 4 Updated Jul 20, 2016
  • Examples for training

    Scala 1 3 Updated Jul 19, 2016
  • A tool to figure out when to grow or shrink a cluster

    Java Updated Jul 12, 2016
  • This is a demo/training application. Used to show how easy it is to do operations like ingestion, aggregation, and change data capture. Using tools like Kafka, Spark Streaming, Flume, Kudu, SolR, HBase, and HDFS

    Scala 1 2 Updated Jul 5, 2016
  • Based off the design of SparkOnHBase. This Repo will support Spark, Spark Streaming, and Spark SQL integration with Kudu.

    Scala 37 44 Updated May 19, 2016
  • Scala 6 9 Updated Feb 10, 2016
  • FooBar

    Scala 3 Updated Nov 12, 2015
  • HBase.MCC (HBase Multi Cluster Client). The goal is to support aways up solutions with HBase through multiple clusters

    Java 8 7 Updated Nov 9, 2015
  • Just for Fun do not use in the real world. :)

    Java 1 Updated Sep 25, 2015
  • kite

    Forked from kite-sdk/kite

    Kite SDK

    Java 197 Updated Jan 6, 2015
  • This is an example of how to do window analysis with Spark

    Scala 2 1 Updated Nov 24, 2014
  • NRT Sessionization with Spark Streaming landing on HDFS and putting live stats in HBase

    Scala 45 42 Updated Oct 31, 2014
  • The ability to rebalance on clusters that have HBase by selecting folders to rebalance

    Java Updated Oct 8, 2014
  • Support to write Seq Files with Spark Streaming with similar functionality as Flume HDFS Sink with Seq Files

    Scala Updated Sep 21, 2014
  • Java 22 Updated Sep 5, 2014
  • spark

    Forked from apache/spark

    Mirror of Apache Spark

    Scala 12,237 Updated Aug 1, 2014
  • Using JRecord to build a mapred and mapreduce inputformat for HDFS, MAPREDUCE, PIG, HIVE, Spark, ...

    Java 6 11 Updated Jul 9, 2014
  • This is a FixedLengthInputFormat for Hadoop map reduce.

    Java 1 1 Updated Jul 6, 2014
  • This is an example of how to make Unique Sequences in a distributed way with Spark (No dups, No Skips)

    Java 1 1 Updated Jul 3, 2014
  • Just some example of using GraphX

    Scala 2 Updated Jul 1, 2014
  • A simple example of using Giraph to root nodes in a tree

    Java 2 Updated Jun 29, 2014
  • A simple program to put files from a directory into HDFS with the added functionality and defining how that action will happen

    Java 1 1 Updated Jun 25, 2014
  • This will do a Merge Join of absolute Sorted data any number of files of ether side.

    Java Updated Jun 17, 2014
  • This will contain implementations that will copy records from a table with less regions then the final table.

    Java 1 Updated May 28, 2014
  • Java 2 Updated May 22, 2014