Skip to content

umbertogriffo/bigdata-notebook

 
 

Repository files navigation

Hadoop and ML repository

A repository to hold all my Hadoop and Machine Learning related codes.

Visit my blog at : www.vishnuviswanath.com

Contents

  1. Flink Streaming
  2. Spark ML, Streaming, SQL and GraphX
  3. Kafka Streams
  4. StormKafka streaming application POC
  5. Flume custom source and config files
  6. Hadoop MapReduce old api joins,custom types etc
  7. Solutions for kaggle problems using numpy or graphlab

Releases

No releases published

Packages

No packages published

Languages

  • Scala 52.1%
  • Java 36.2%
  • Python 11.0%
  • Shell 0.7%