This repository is private.
All pages are served over SSL and all pushing and pulling is done over SSH.
No one may fork, clone, or view it unless they are added as a member.
Every repository with this icon (
) is private.
Every repository with this icon (
This repository is public.
Anyone may fork, clone, or view it.
Every repository with this icon (
) is public.
Every repository with this icon (
aviary /
| name | age | message | |
|---|---|---|---|
| |
.gitignore | Sat Mar 08 14:47:53 -0800 2008 | [mja] |
| |
MIT-LICENSE | Fri Mar 07 15:26:26 -0800 2008 | [Mark James Adams] |
| |
MOTIVATION | Fri Mar 07 14:54:12 -0800 2008 | [Mark Adams] |
| |
README | Sat Apr 19 10:12:25 -0700 2008 | [mja] |
| |
TODO | Sun Mar 09 14:10:49 -0700 2008 | [Geoff Cheshire] |
| |
assets/ | Sat Mar 08 16:53:04 -0800 2008 | [mja] |
| |
aviary.rb | Sat Apr 19 10:09:14 -0700 2008 | [mja] |
| |
cat_statuses.sh | Sat Mar 08 16:53:04 -0800 2008 | [mja] |
| |
config-example.yml | Sat Apr 19 09:59:16 -0700 2008 | [mja] |
| |
timeline.rb | Sat Mar 08 16:53:04 -0800 2008 | [mja] |
README
== Aviary, a Twitter Archiver Aviary is a simple Ruby script to retrieve and archive your tweets. Twitter "claim no intellectual property rights over the material you provide to the Twitter service," so you should have a way of storing and repurposing your data. == Requirements Aviary requires the Hpricot and builder gems. == Running Aviary First, copy the config-example.yml to config.yml, then fill in your username and password. Run Aviary with: $ ruby aviary.rb --updates [new|all] [--page XXX] Aviary will create a directory called USERNAME and begin parsing your Twitter archive and downloading the raw XML representation of each tweet. The "--updates all" option will parse and save every tweet, while "--updates new" will stop parsing your online archive when it encounters a tweet that's already been downloaded. Optionally, specify a page number to retrieve tweets starting from that page. This is useful when restarting the script after a timeout. Currently Aviary will not use your password to authenticate requests, so your timeline will need to be public. This is to get around the API request limits. Use the cat_statuses.sh script to combine all downloaded tweets for a user into one statuses XML file. $ SCREEN_NAME=username ./cat_statuses.sh > username.xml will read in all tweets in the username directory and concatenate them together in a file called username.xml. == Creating an events timeline Assuming you have already put all of your status files into a single XML file using cat_statuses.sh, you can transform it into a SIMILE timeline (http://simile.mit.edu/timeline/) using the assets/simili.xsl transformation stylesheet. View the data by modifying assets/events_template.html, replacing "screenname_events.xml" (Line 66) with the filename of your transformed statuses XML document. This can also be done automatically using the timeline.rb script: $ ruby timeline.rb -u USERNAME which will read in single tweets from the USERNAME's directory and construct an events timeline (USERNAME_events.xml) in the timelines directory. == What next? Now your data is yours! Start thinking about your own way to utilize your own data. Start thinking about your children's children's children. Will they be accessing the possibly non-existent http://twitter.com or would they prefer a paper record of "what you were doing"?




