You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Only shows a piece of the puzzle -- the stuff manually downloaded by data rescuers at hackathons using archivers.space. Doesn't include stuff crawled by IA, which was seeded/nominated by hackathons but not downloaded there. Doesn't include the 30 PB of data and 100 TB of web content from Data Rescue Boulder. Doesn't include the stuff from Project Svalbard.
Need a view that shows all of that stuff together, from the frame of data vulnerability, access, and preservation. That's what the public record should show.
Just moving my comment from the #datatogether slack channel:
I feel like this screenshot is disengenuous of the work EDGI has done and some reservations about it being used to reflect archiving work:
0. this version of the system was never used at a DR event (it came online after all of ‘em!)
These aren’t the original primers (which are still all gdocs) that were used at DR events
that URL number doesn’t reflect anything flagged at DRs (@b5 correct me if I’m wrong), it actually is just the whole new archivers v2 crawlers)
doesn’t reflect what was seeded to IA
also isn’t mapped to the datarefuge.org site
“content” there as a concept doesn’t necessarily map to the way objects that were archived at events (it is more granular)
Tell Story of the emergence of the need for a "data rescue" public record, and the creation of alpha.archivers.space
The text was updated successfully, but these errors were encountered: