This repo requires the following to run:
- Node.js
- Docker and docker-compose
- Python
First, please clone this repo
git clone https://github.com/duong19/Twitter-Analysis.git
Run this script
cd Twitter-Analysis
pip install -r requirements.txt
cd streaming
npm install
Execute this script to get your docker running
docker-compose up --build
Open Pyspark Notebook, run file Streaming Tweets in Spark.ipynb then Streaming Tweets from Hadoop.ipynb
Go to this url to see your graphs