GitHub - sangheestyle/bison: Homework for HCC Big Data Fall semester 2013

#bison#

Homework for HCC Big Data Fall semester 2013

Goal

crawling data from tweet server via tweet API (Twython)
loading data from text files including json dump and save it to tsv (Pandas)
loading dataframe from json or tsv (Pandas)
TBD: giving some simple ways to analyze tweet data with tweet data fields (Pandas)
TBD: viaulizing data (Numpy, Google Chart)

Data Center

The data stored from query is saved periodically at following github repo from Amazon EC2
https://github.com/sangheestyle/datacenter-for-bision

Reference

Twitter REST API v1.1 Resources: https://dev.twitter.com/docs/api/1.1
Twython (Python implementation for Using Twitter REST API): https://github.com/ryanmcgrath/twython

Demo

Following demo visualizations allow you to know what this programe does eventually. It takes about 1 min to load graph because they use Google Chart for visualization.

Scene 2: Geo Chart - http://goo.gl/vyi1Gy
Scene 3: Hourly Trends with outliers - http://goo.gl/L26mmP
Scene 3: Hourly Trends without outliers - http://goo.gl/1yaekZ
Scene 4: Weekly Tweet Trends - http://goo.gl/Y28HrQ
Scene X1: Official Mobile Apps - http://goo.gl/OzYu0J

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
analyzer.py		analyzer.py
crawlingtweets.py		crawlingtweets.py
tweetdataframe.py		tweetdataframe.py
tweetdatastore.py		tweetdatastore.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases 2

Packages

Contributors 3

Uh oh!

Languages

sangheestyle/bison

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 3

Uh oh!

Languages

Packages