Save and export historical tweets using Google Realtime
Switch branches/tags
Nothing to show
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.


Pirate API for Google Realtime to liberate historical twitter data saved there into sqlite database and CSV format.  It just crawls through all the required web pages and parses the necessary data out of them.

- python 2.6+
- sqlalchemy
- lxml

1. Edit, replace 'initURL' with the URL of the first google realtime result page you want to hit, and 'endtime' for the datetime of the last tweet you want saved.
2. Run, should create sqlite database.db.
3. Run, it'll slowly hit google and fill the db with page HTML.  Can be re-run without losing it's place.
4. Run, it'll parse out tweets from all the pages you saved and save them plaintext in the db.
5. Run, it'll create tweets.csv with plain text tweets and dates.