Skip to content
This repository

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP

little utility for downloading kasabi datasets and uploading to Internet Archive

branch: master

Fetching latest commit…

Octocat-spinner-32-eaf2f5

Cannot retrieve the latest commit at this time

Octocat-spinner-32 dataset
Octocat-spinner-32 .gitignore
Octocat-spinner-32 README.md
Octocat-spinner-32 datasets.csv
Octocat-spinner-32 kasabi.py
Octocat-spinner-32 requirements.txt
README.md

kasabi-archive

The Kasabi data publishing platform created by Talis was announced to be closing on July 30, 2012. While the service has only been around for ~2 years it represents a unique look at services for Linked Data, and contains a variety of datasets. In a subsequent blog post Kasabi announced the availability of a spreadsheet that lists where datasets can be downloaded from Amazon S3.

kasabi-archive is a little one-off utility for downloading Kasabi data from s3 and putting it up at Internet Archive. Before you can run kasabi.py you will need to get Internet Archive access keys and add them to a .boto file in your home directory that looks like this:

[Credentials]
ia_access_key_id=[your-internet-archive-access-key]
ia_secret_access_key=[your-internet-archive-secret-access-key]

Then follow these steps:

  1. pip install -r requirements.txt
  2. ./kasabi.py

The results are available at http://archive.org/details/kasabi

License

  • CC0
Something went wrong with that request. Please try again.