Hunt down the secrets from the WebArchives for Fun and Profit
-
Updated
Dec 8, 2022 - Python
Hunt down the secrets from the WebArchives for Fun and Profit
Summarize web archive capture index (CDX) files.
🗄 Save an archived copy of websites from Pocket/Pinboard/Bookmarks/RSS. Outputs HTML, PDFs, and more...
A Tool to Summarize Web Archive Holdings
A mirror of The Huddle magazine
Miscellaneous utility scripts
Crawls the web to generate a huge dataset for training
Add a description, image, and links to the web-archive topic page so that developers can more easily learn about it.
To associate your repository with the web-archive topic, visit your repo's landing page and select "manage topics."