#
web-archive
Here are 10 public repositories matching this topic...
Crawls the web to generate a huge dataset for training
-
Updated
Jan 24, 2024 - Python
Miscellaneous utility scripts
-
Updated
Dec 14, 2023 - Python
Hunt down the secrets from the WebArchives for Fun and Profit
osint
bughunting
security-tools
web-archive
subdomain-scanner
subdomain-enumeration
email-enumeration
-
Updated
Dec 8, 2022 - Python
A mirror of The Huddle magazine
-
Updated
Aug 10, 2022 - Python
Summarize web archive capture index (CDX) files.
-
Updated
Jul 29, 2022 - Python
A Tool to Summarize Web Archive Holdings
-
Updated
Jun 15, 2021 - Python
🗄 Save an archived copy of websites from Pocket/Pinboard/Bookmarks/RSS. Outputs HTML, PDFs, and more...
python
rss
firefox
backup
browser
pinboard
safari
google-chrome
bookmarks
chromium
wget
pocket
archive
web-browser
web-archiving
preservation
headless-chrome
web-archive
html-export
headless-browser
-
Updated
Aug 12, 2018 - Python
Improve this page
Add a description, image, and links to the web-archive topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the web-archive topic, visit your repo's landing page and select "manage topics."