In this project, we generate some analytics on twitter tweet data, regarding US elections tweets.
We parse the JSON data and extract the tweet data, partition the data into groups, count the number of posts from each partition and finally find popular tokens the each partition's tweets.
This project is part of Big Data Analytics using Spark course from edx.com.
Python Jupyter Notebook
- run as a regular Jupyter notebook
credit to Big Data Analytics using Spark on edx.com