The Kasabi data publishing platform created by Talis was announced to be closing on July 30, 2012. While the service has only been around for ~2 years it represents a unique look at services for Linked Data, and contains a variety of datasets. In a subsequent blog post Kasabi announced the availability of a spreadsheet that lists where datasets can be downloaded from Amazon S3.
kasabi-archive is a little one-off utility for downloading Kasabi data from s3 and putting it up at Internet Archive. Before you can run kasabi.py you will need to get Internet Archive access keys and add them to a .boto file in your home directory that looks like this:
[Credentials] ia_access_key_id=[your-internet-archive-access-key] ia_secret_access_key=[your-internet-archive-secret-access-key]
Then follow these steps:
- pip install -r requirements.txt
The results are available at http://archive.org/details/kasabi