Skip to content
a Hadoop Map Reduce application that retrieves data/articles related to sports from sources like NY Times, Commoncrawl, and Twitter and creates a word cloud of most frequently occurring words. Python scripts are developed for gathering data and processing on a Hadoop MR infrastructure. Angular with D3.js is used to create an interactive web app …
Jupyter Notebook Python
Branch: master
Clone or download
Latest commit 71f0bb2 Oct 3, 2019
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
DataCollection Added code files Oct 3, 2019
MapperReducer Added code files Oct 3, 2019
LICENSE Initial commit Oct 3, 2019
README.md Update README Oct 3, 2019
Report.pdf Added code files Oct 3, 2019

README.md

SportsDataAnalysis

a Hadoop Map Reduce application that retrieves data/articles related to sports from sources like NY Times, Commoncrawl, and Twitter and creates a word cloud of most frequently occurring words. Python scripts are developed for gathering data and processing on a Hadoop MR infrastructure. Angular with D3.js is used to create an interactive web app that displays the word cloud:

Demo of webapp : https://dicpro-ychandra-mehulawa.herokuapp.com/

You can’t perform that action at this time.