GitHub is home to over 31 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
Scripts setting up infrastructure for trainings
Heerkat (Hadoop Meerkat) is a Python framework for implementing and executing smoke tests that verify the correctness of your Hadoop cluster and tools like HDFS, Hive, Solr, HBase, Oozie and more.
Example application for Kafka Streams training
A CLI and Go client for Confluent's Kafka Schema Registry
Mirror of Apache Flink
Dockerfiles for Confluent Stream Data Platform
Skeleton for Spark Application with HiveContext and tests
HiBench is a Hadoop benchmark suite.