Skip to content
little utility for downloading kasabi datasets and uploading to Internet Archive
Find file
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.


The Kasabi data publishing platform created by Talis was announced to be closing on July 30, 2012. While the service has only been around for ~2 years it represents a unique look at services for Linked Data, and contains a variety of datasets. In a subsequent blog post Kasabi announced the availability of a spreadsheet that lists where datasets can be downloaded from Amazon S3.

kasabi-archive is a little one-off utility for downloading Kasabi data from s3 and putting it up at Internet Archive. Before you can run you will need to get Internet Archive access keys and add them to a .boto file in your home directory that looks like this:


Then follow these steps:

  1. pip install -r requirements.txt
  2. ./

The results are available at


  • CC0
Something went wrong with that request. Please try again.