Query and download archive.org as simple as possible.
-
Updated
Jun 9, 2024 - Python
Query and download archive.org as simple as possible.
A suite of tools for mirroring and hoarding web pages you visit for later offline viewing. I.e. your own personal Wayback Machine that can also archive HTTP POST requests and responses, as well as most other HTTP-level data, which also follows "archive everything now, figure out what to do with it later" philosophy.
Archivooor is a Python package for interacting with the archive.org API.
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
A bot that generates videos of old web pages and media files archived by the Wayback Machine to then publish them on Twitter, Mastodon, and Tumblr.
FeedVault is an open-source web application that allows users to archive and search their favorite web feeds.
Archived tweets on Wayback Machine in an easy way
Automates the extraction of compressed files (which may not have the correct extension) within a folder
Script to scrap URLs from a webpage and archive them on the Wayback machine.
AutoInternetArchive is a very simple program designed to automatically archive webpages to The wayback machine with hourly intervals. AutoInternetArchive was designed to be run though a console window and left open for days or even months
Wayback Machine API interface & a command-line tool
A command-line utility and Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.
Archive a list of URLs using the Wayback Machine
Tool (Scripts that complement the hartator/wayback-machine-downloader software output) in Python-3
A Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.
A lightweight tool for scraping current and historic Google Analytics data
Discord bot created to automate bug bounty recon, automated scans and information gathering via a discord server
Download and archive RSS feeds to Wayback Machine. Save a list of archived feed in locad db.
A Python script to submit web pages to the Wayback Machine for archiving.
Get URLs from the Wayback Machine. Able to handle large outputs.
Add a description, image, and links to the wayback-machine topic page so that developers can more easily learn about it.
To associate your repository with the wayback-machine topic, visit your repo's landing page and select "manage topics."