Skip to content

Harshpatel44/Data-Analysis-On-News-And-Twitter-Data

Repository files navigation

Data-Analysis-On-News-And-Twitter-Data

This repository contains analysis on news and twitter data.

Final analysis is provided in report.pdf

Summary

1. I Extracted twitter and news data, performed data cleaning and stored in MongoDB.

2. Used map-reduce approach in Apache Spark, stored data in RDD and analysed frequency of phrases.

3. Used the twitter data to generate wordcloud using Tableu.

4. Technologies used: Python, MongoDB, Apache Spark, Tableu.

About

This repository contains analysis on news and twitter data.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages