A global metadata vault
JavaScript
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
cli
metadata
.gitignore
readme.md

readme.md

Svalbard

A global metadata vault for public domain datasets. A Dat Project initiative. Named after the Svalbard Global Seed Vault.

pic

The target users for this information are other archivists who are wishing to coordinate on what they are crawling and storing. We hope to contribute to data backup efforts with this repository by collecting in one place a "dataset of datasets".

Status

Svalbard V1 release is out!. You can download it with Dat here: https://datproject.org/de8cb55dcf2bee13b6cf86a6c4619f2368a66ffe0a0b270784bc386fcfa6ee70.

In progress sources are being tracked in the issue tracker.

Current Sources

data.gov

internet archive

Using the data

You can use any tool that supports JSON Lines to analyze the data, here is a tutorial.