Skip to content
This repository has been archived by the owner on Nov 19, 2022. It is now read-only.
/ sentiment-detect Public archive

Monitor Twitter stream and identify on unexpected increases in tweet volume

Notifications You must be signed in to change notification settings

iyunbo/sentiment-detect

Repository files navigation

Monitor Twitter stream and identify on unexpected increases in tweet volume

This is WIP. Getting Started

Current infrastructure:

  • TweetCollector serializes Tweets (without code generation) to Avro and sent to Kafka
  • TweetAnalyzer picks up serialized Tweets and monitor tweets for unexpected volume in Spark
  • Volume thresholds and detected alerts managed in HDFS

image

Getting started

  1. Get Twitter credentials and fill them in reference.conf.example and rename to reference.conf

  2. Start Kafka (instructions) in single-node mode on localhost

  3. Start TweetCollector

./gradlew collect 

This will start to read recent tweets, encode them to Avro and send to the Kafka cluster in binary format (Array[Byte]).

  1. Start TweetAnalyzer
 ./gradlew analyze

This will run Spark streaming connected to the Kafka queue. In 5-second intervals the program reads tweets from Kafka, analyzes the tweet texts and print the 10 most tweeted company of the 400 S & P

About

Monitor Twitter stream and identify on unexpected increases in tweet volume

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published