Skip to content

Personal project where I perform some analytics (including Sentiment Analysis) over a Twitter Stream using Big Data Technologies of the Hadoop echosystem such as Flume, Kafka, and Spark Streaming.

Notifications You must be signed in to change notification settings

dsu4rez/bigdata-realtime-twitter-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

bigdata-realtime-twitter-analysis

Personal project where I perform some analytics (including Sentiment Analysis) over a Twitter Stream using Big Data Technologies of the Hadoop echosystem such as Flume, Kafka, and Spark Streaming.

Project Architecture

If you want to know more about this project, you can see a detailed explanation in my personal blog the docs folder of this repo.

This Git repository contains:

  • Flume

    • Custom Components to Stream and Filter Tweets (Custom Event Driven Source and Custom Interceptor)
    • Flume Agent Configuration File
  • Spark Streaming: Code in Python for a real-time Twitter sentiment analysis using TextBlob (PyCharm Project)

  • Dashboard: Flask Web Application for displaying the results using Chart.js (PyCharm Project)

  • Docs: containing originial posts published on my old blog.

About

Personal project where I perform some analytics (including Sentiment Analysis) over a Twitter Stream using Big Data Technologies of the Hadoop echosystem such as Flume, Kafka, and Spark Streaming.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages