#bison#
Homework for HCC Big Data Fall semester 2013
Goal
- crawling data from tweet server via tweet API (Twython)
- loading data from text files including json dump and save it to tsv (Pandas)
- loading dataframe from json or tsv (Pandas)
- TBD: giving some simple ways to analyze tweet data with tweet data fields (Pandas)
- TBD: viaulizing data (Numpy, Google Chart)
Data Center
- The data stored from query is saved periodically at following github repo from Amazon EC2
- https://github.com/sangheestyle/datacenter-for-bision
Reference
- Twitter REST API v1.1 Resources: https://dev.twitter.com/docs/api/1.1
- Twython (Python implementation for Using Twitter REST API): https://github.com/ryanmcgrath/twython
Demo
Following demo visualizations allow you to know what this programe does eventually. It takes about 1 min to load graph because they use Google Chart for visualization.
- Scene 2: Geo Chart - http://goo.gl/vyi1Gy
- Scene 3: Hourly Trends with outliers - http://goo.gl/L26mmP
- Scene 3: Hourly Trends without outliers - http://goo.gl/1yaekZ
- Scene 4: Weekly Tweet Trends - http://goo.gl/Y28HrQ
- Scene X1: Official Mobile Apps - http://goo.gl/OzYu0J