This repository contains technical specifications used by the Webrecorder project for building interoperable web archiving tools. Please help us adapt and implement them to ensure that web archives are easier to use, trust, and share on the web.
- Use Cases for Decentralized Web Archives: a summary of requirements and potential threat models for distributed web archives
- Web Archive Collection Zipped (WACZ): a packaging standard for web archives on the web
- WACZ Signing and Verification: the mechanics for signing and verifying WACZ files for proof of authenticity
- Crawl Index JSON (CDXJ): an extensible format for WARC index files
Please help us adapt and implement these community standards to help ensure that web archives are easier to use, trust, and share on the web.
We are using Git and GitHub to manage the versioning of these specifications, and also the GitHub Issue Tracker to track ongoing work. If you have questions or suggestions for the specifications, or need implementation guidance please use the issue tracker!