Few experiments using Kafka and Spark. The aim is to process stream of data taken from Twitter and exploit Kafka's consumers for providing some insights:
- CountingLang.scala: it counts the most popular languages from Twitter's users.
- CountingSource.scala: it counts the most popular platform (iPhone, iPad, Android, Web etc).
- CountingHashtags: it counts the most popular hashtags
- STData Labs: real time sentimental analysis from Twitter
- Twitter API [documentation] (https://developer.twitter.com/docs)
- Publishing with Kafka at NYT
- Netflix tech blog, Kafka inside Keystone pipeline