spark-samples-jeeconf-kyiv
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
api
spark-distributed-library
spark-driver
.gitignore
README.md
pom.xml

README.md

spark-samples-jeeconf-kyiv

Simple application to demonstrate features of Spark core and Spark SQL components.

Provides analytics related Morning@Lohika events:

  • unique participants by companies
  • most loyal participants
  • participants by position
  • etc.

Features:

  • simple HTTP-based API
  • file system: local and HDFS
  • data formats: CSV and Parquet
  • 3 compatible implementations based on: RDD (Spark Core), Data Frame DSL (Spark SQL), Data Frame SQL (Spark SQL)
  • serialization: default Java and Kryo

In case of any questions please contact me directly via taras.matyashovsky@gmail.com