Skip to content

A spark based twitter streaming application that was used to process valentines tweets from all over India during the Valentines week of 2016 for analyzing the cities and states of India in love

License

Notifications You must be signed in to change notification settings

sahilsareen/LoveIsInTheAir

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LoveIsInTheAir

A spark based twitter streaming application that was used to process valentines tweets from all over India during the Valentines week of 2016 for analyzing the cities and states of India in love!

This app live streams tweets from twitter and filters tweets from India, then determines the city from each tweet and accumulates city names and corresponding tweet counts for all the cities found. These tuples of city names and corresponding counts are stored in a text file for further analysis.

LoveIsInTheAir-Stats-Plotter is used to map each city to a state in India and plot a pie chart distribution of tweets for all Indian states.

Spark streaming UI

Results

  • 27.20 Million tweets from India processed during the valentines week of 2016.
  • State wise distribution of tweets:
  • Upon normalization of tweets by area of state, Delhi scores 89.704192
  • Only 0.0418588% of tweets from India were with a location embedded

Setup and HowTo

  1. Install sbt

  2. Clone LoveIsInTheAir: git clone https://github.com/sahilsareen/LoveIsInTheAir.git

  3. Create a new twitter app and generate an access token.

  4. Run cd LoveIsInTheAir && sbt package run <consumer key> <consumer secret> <access token> <access token secret> [<twitter love filters>]

  5. After collecting sufficient data, use LoveIsInTheAir-Stats-Plotter to visualize results.

Contributing

  1. Generate a pull request, OR
  2. Email patches to sahil [DOT] sareen [AT] hotmail [DOT] com

License

See License

Author

  • Sahil Sareen (sahil [DOT] sareen [AT] hotmail [DOT] com)

About

A spark based twitter streaming application that was used to process valentines tweets from all over India during the Valentines week of 2016 for analyzing the cities and states of India in love

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages