Web archiving software suite.
Work in progress. The code, functionality, and documentation is incomplete.
What's mostly implemented:
- WARC files
- List contents, dump & load as JSON, and extract files
- API: read and write WARC files
The following is planned:
- Traditional web crawler & archiver
- MITM proxy server capture
- Browser-based capture
- Alternative archive file format
Downloads can be found on the Releases section.
If you want to compile the application yourself, you can do so using cargo from the webaves-app
crate.
For information on how to use the application, see the user guide. If you need help, please check the Discussions section.
The components of Webaves can be reused in your own Rust projects from the webaves
crate.
See Contributing for information about bug reports and contributing to the project.
Copyright 2022 Christopher Foo. Licensed under Mozilla Public License Version 2.0.