Grow your team on GitHub
GitHub is home to over 28 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.Sign up
Skeleton project for Airflow training participants to work on.
Provision the training environment (right now only for the Data Science with Spark on Dataproc trainings
from python import wat?!
Python interface to Hive and Presto.
A utility tool to automate certain tasks with Jupyter notebooks.
a python grammar for evolutionary algorithms and heuristics
A selection of notebooks coming from the GoDataDriven trainings
Genetic algorithms and the game of Risk
Extract data from JIRA through REST and create charts.
Ansible scripts to create druid cluster
Example project demonstrating easy, concise and typechecked JDBC access
Material for PyData Code Breakfast: Introduction to Deep Learning
Repository for provisioning GDD HDInsight Cluster.
The iterative broadcast join example code.
Balancing Heroes and Pokemon in Real Time: A Streaming Variant of Trueskill for Online Ranking
Scripts to provision NiFi to HDInsight
Hadoop smoke testing framework
Azure Quickstart Templates
Sources to our blog
repo to demonstrate streaming game imbalance
Example how to use Flink with Kafka
bigger simulations = moar profit