Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dataset Containers #8

Open
hosh opened this issue Mar 13, 2015 · 7 comments
Open

Dataset Containers #8

hosh opened this issue Mar 13, 2015 · 7 comments

Comments

@hosh
Copy link
Contributor

hosh commented Mar 13, 2015

  1. Make a data container with IPFS, load up data.
  2. Start container elsewhere.
  3. Boom, access to data!

Overview

This is the most useful for open data. Examples:

  1. http://www.fastonline.org/CD3WD_40/CD3WD/INDEX.HTM CD3WD. This is a CD-ROM containing 900+ publications for basic infrastructure technology designed to be distributed for 3rd world countries. This library could be reformatted to work with a web. We can add an nginx server or something similar: startup this container via IPFS, and you now have access to this library.
  2. https://www.govtrack.us/developers/data GovTrack publishes raw data from US Congress, specifically related to legislations that are being created or debated. Because of the size of the data, GovTrack has only a single firehose to PopVox. Can take a snapshot of this and make this available, or better yet, have GovTrack publish this data. [This might be too big to function as a good demo for this]
  3. http://www.nyc.gov/html/doitt/html/open/data.shtml New York City open data
  4. https://www.kickstarter.com/projects/publicresource/public-safety-codes-of-the-world-stand-up-for-safe US Public Safety codes. These are publications painstakingly created and put out into the public. He has other data here: https://public.resource.org/
  5. Zipcode geolocation dataset
  6. International Timezone dataset
  7. OpenStreetMap dataset?
@jbenet
Copy link
Contributor

jbenet commented Mar 14, 2015

  • ipfs mount (fuse in the container -- probably need to run dockerd in privileged mode)
  • prepared datasets

@hosh
Copy link
Contributor Author

hosh commented Mar 15, 2015

@jbenet do you think there is a way to have the dataset inside the container?

Though that might be a stretch, carting it around. As a seed it would work very well though. Hmm.

@jbenet
Copy link
Contributor

jbenet commented Mar 15, 2015

@hosh oh yeah we could ship a container with a node initialized and the data already added to the container. We'd just have to generate an ephemeral ID on first run (so multiple containers have different IDs).

@hosh
Copy link
Contributor Author

hosh commented Mar 15, 2015

@jbenet roger

@jbenet
Copy link
Contributor

jbenet commented Mar 15, 2015

@hosh another way would be to just issue an ipfs pin add -r command through the http api after the container boots. in that case the data would have to be somethere already though.

@hosh
Copy link
Contributor Author

hosh commented Mar 16, 2015

@jbenet I didn't know the http API lets you issue ipfs commands. Huh.

@jbenet
Copy link
Contributor

jbenet commented Mar 16, 2015

Yeah that's how the cli issues commands when daemon is on. (And why the api is separate from gateway route, so we can expose gateway without exposing api)


Sent from Mailbox

On Mon, Mar 16, 2015 at 11:03 AM, Ho-Sheng Hsiao notifications@github.com
wrote:

@jbenet I didn't know the http API lets you issue ipfs commands. Huh.

Reply to this email directly or view it on GitHub:
#8 (comment)

@jbenet jbenet mentioned this issue Mar 30, 2015
52 tasks
@jbenet jbenet mentioned this issue Apr 6, 2015
42 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants