Skip to content
little utility for downloading kasabi datasets and uploading to Internet Archive
Python
Find file
Latest commit 25ad0da @edsu removed size sanity check
Failed to load latest commit information.
dataset first commit
.gitignore just one module now, added named graph upload, also ability to check …
README.md just one module now, added named graph upload, also ability to check …
datasets.csv
kasabi.py removed size sanity check
requirements.txt just one module now, added named graph upload, also ability to check …

README.md

kasabi-archive

The Kasabi data publishing platform created by Talis was announced to be closing on July 30, 2012. While the service has only been around for ~2 years it represents a unique look at services for Linked Data, and contains a variety of datasets. In a subsequent blog post Kasabi announced the availability of a spreadsheet that lists where datasets can be downloaded from Amazon S3.

kasabi-archive is a little one-off utility for downloading Kasabi data from s3 and putting it up at Internet Archive. Before you can run kasabi.py you will need to get Internet Archive access keys and add them to a .boto file in your home directory that looks like this:

[Credentials]
ia_access_key_id=[your-internet-archive-access-key]
ia_secret_access_key=[your-internet-archive-secret-access-key]

Then follow these steps:

  1. pip install -r requirements.txt
  2. ./kasabi.py

The results are available at http://archive.org/details/kasabi

License

  • CC0
Something went wrong with that request. Please try again.