GitHub is home to over 40 million developers
use GitHub to host and review code, manage projects, and build software
together across more than 100 million repositories.
A framework for moving data into a data warehouse
Repo for the presentation Getting Started With SparklyR
Repo for Meet Up Titled: Reproducible Research with R, The Tidyverse, Jupyter, and Spark
Repo for the Kansas City Apache Spark Meetup entitled Perfecting Your Streaming Skills with Spark and Real World IoT Data
A sample ETL process with Python
Example code showing how to load a Type II SCD with T-SQL
Data files for Hands On: Introduction to the Hadoop Ecosystem
A SQL script that allows you to monitor how much data is getting loaded to your tables.