Sean's learning-spark project
This repo contains various Spark projects I've created to help learn spark for myself, teach others, present, and other useful information I've accumulated.
exactlyonce project is a demonstration of implementing Kafka's Exactly Once message delivery semantics with Spark Streaming, Kafka, and Cassandra.
stackanalysis project analyzes StackOverflow.com post data to discover insights in regards to Scala questions asked on the site.
githubstream project consumes data directly from the public Github Events API and demonstrates some common streaming capabilities of Apache Spark.