Skip to content

A data pipeline with Docker to perform Sentiment Analysis on tweets and post it on a slack channel via a bot

License

Notifications You must be signed in to change notification settings

miladbehrooz/Dockerized_Data_Pipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A Dockerized Data Pipeline for Sentiment Analysis on tweets

There are 5 steps in the data pipeline:

  • Extract tweets with Tweepy API
  • Loaded the tweets in a MongoDB
  • Extracted the tweets from MongoDB, performed sentiment analysis on the tweets, and loaded the transformed data in a PostgresDB (ETL job)
  • Loaded the tweets and corresponding sentiment assessment in a PostgresDB
  • Extracted the data from the PostgresDB and posted it in a slack channel with a Slack bot

workflow

Usage

  • Install Docker on your machine
  • Clone the repository: git clone https://github.com/miladbehrooz/Dockerized_Data_Pipeline.git
  • Get credentials for Twitter API and insert them in tweet_collector/credentials.py
  • Get credentials for Slack bot and insert them in slack_bot/credentials.py
  • Run docker-compose build, then docker-compose up in terminal

About

A data pipeline with Docker to perform Sentiment Analysis on tweets and post it on a slack channel via a bot

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published