[public record] Tell the background story of public record for "Rescued" data #3

flyingzumwalt · 2017-07-26T18:26:43Z

Tell Story of the emergence of the need for a "data rescue" public record, and the creation of alpha.archivers.space

all the data rescue efforts happened
need a way to answer "what has been rescued so far?"
challenge: reconciling disparate metadata patterns -- original discussion in Compare the Metadata that Different Groups have about the Datasets They have Rescued dataset_registries#1
upshot: alpha.archivers.space > datatogether/public-record

flyingzumwalt · 2017-07-26T18:53:32Z

Contextualize this screenshot from https://alpha.archivers.space/primers/5b1031f4-38a8-40b3-be91-c324bf686a87 (image below), which does not accurately represent what has or hasn't been achieved by hackathon rescue efforts..

Only shows a piece of the puzzle -- the stuff manually downloaded by data rescuers at hackathons using archivers.space. Doesn't include stuff crawled by IA, which was seeded/nominated by hackathons but not downloaded there. Doesn't include the 30 PB of data and 100 TB of web content from Data Rescue Boulder. Doesn't include the stuff from Project Svalbard.

Need a view that shows all of that stuff together, from the frame of data vulnerability, access, and preservation. That's what the public record should show.

cc @shapironick

dcwalk · 2017-07-28T22:57:46Z

Just moving my comment from the #datatogether slack channel:

I feel like this screenshot is disengenuous of the work EDGI has done and some reservations about it being used to reflect archiving work:
0. this version of the system was never used at a DR event (it came online after all of ‘em!)

These aren’t the original primers (which are still all gdocs) that were used at DR events

that URL number doesn’t reflect anything flagged at DRs (@b5 correct me if I’m wrong), it actually is just the whole new archivers v2 crawlers)

doesn’t reflect what was seeded to IA

also isn’t mapped to the datarefuge.org site

“content” there as a concept doesn’t necessarily map to the way objects that were archived at events (it is more granular)

flyingzumwalt added the enhancement label Jul 26, 2017

Frijol added the stale label Oct 24, 2019

Frijol closed this as completed Oct 24, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[public record] Tell the background story of public record for "Rescued" data #3

[public record] Tell the background story of public record for "Rescued" data #3

flyingzumwalt commented Jul 26, 2017

flyingzumwalt commented Jul 26, 2017 •

edited

Loading

dcwalk commented Jul 28, 2017

[public record] Tell the background story of public record for "Rescued" data #3

[public record] Tell the background story of public record for "Rescued" data #3

Comments

flyingzumwalt commented Jul 26, 2017

flyingzumwalt commented Jul 26, 2017 • edited Loading

dcwalk commented Jul 28, 2017

flyingzumwalt commented Jul 26, 2017 •

edited

Loading