#

wayback-machine

Here are 64 public repositories matching this topic...

bitdruid / python-wayback-machine-downloader

Query and download archive.org as simple as possible.

scraper python3 wayback-machine osint-python archive-org wayback-downloader archive-downloader

Updated Jun 9, 2024
Python

Own-Data-Privateer / pwebarc

A suite of tools for mirroring and hoarding web pages you visit for later offline viewing. I.e. your own personal Wayback Machine that can also archive HTTP POST requests and responses, as well as most other HTTP-level data, which also follows "archive everything now, figure out what to do with it later" philosophy.

backups internet self-hosted archive web-archiving wayback-machine internet-archiving

Updated Jun 7, 2024
Python

Barabazs / archivooor

Archivooor is a Python package for interacting with the archive.org API.

spn wayback-machine archive-org save-page-now spn2

Updated Jun 7, 2024
Python

ArchiveBox

ArchiveBox / ArchiveBox

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

Updated Jun 6, 2024
Python

joaohenggeler / eternal-wanderer

A bot that generates videos of old web pages and media files archived by the Wayback Machine to then publish them on Twitter, Mastodon, and Tumblr.

bot twitter-bot internet-archive wayback-machine tumblr-bot mastodon-bot

Updated Jun 6, 2024
Python

TheLovinator1 / FeedVault.se

FeedVault is an open-source web application that allows users to archive and search their favorite web feeds.

rss backup archive internet-archive atom-feed rss-aggregator wayback-machine internet-archiving archivebox rss-archive feed-archive

Updated Jun 5, 2024
Python

claromes / waybacktweets

Archived tweets on Wayback Machine in an easy way

osint twitter internet-archive tweet wayback-machine socmint streamlit osint-tools

Updated Jun 8, 2024
Python

TiagoCavalcante / extract-folder

Automates the extraction of compressed files (which may not have the correct extension) within a folder

script unzip wayback-machine

Updated Apr 19, 2024
Python

4rnv / Scrappy

Script to scrap URLs from a webpage and archive them on the Wayback machine.

python scraper wayback-machine

Updated Apr 4, 2024
Python

pebnn / AutoInternetArchive

AutoInternetArchive is a very simple program designed to automatically archive webpages to The wayback machine with hourly intervals. AutoInternetArchive was designed to be run though a console window and left open for days or even months

python bot simple internet-archive automatic easy-to-use auto web-archiving wayback-machine wayback-archiver waybackmachine waybackpy autointernetarchive

Updated Mar 11, 2024
Python

waybackpy

akamhy / waybackpy

Wayback Machine API interface & a command-line tool

osint internet-archive web-archiving wayback-machine webarchiving cdx-api internet-archiving savepagenow archive-webpage archive-webpages wayback-machine-api wayback-machine-python

Updated Feb 26, 2024
Python

sangaline / wayback-machine-scraper

A command-line utility and Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.

python web-scraping command-line-tool wayback-machine wayback-archiver archive-dot-org

Updated Feb 23, 2024
Python

capture-urls

rybesh / capture-urls

Archive a list of URLs using the Wayback Machine

web-archiving wayback-machine save-page-now

Updated Feb 21, 2024
Python

Luraminaki / WaybackMachineDownloaderCompanion

Tool (Scripts that complement the hartator/wayback-machine-downloader software output) in Python-3

python3 wayback-machine

Updated Feb 20, 2024
Python

sangaline / scrapy-wayback-machine

A Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.

python middleware web-scraping scrapy wayback-machine scrapy-extension archive-dot-org

Updated Feb 18, 2024
Python

bellingcat / wayback-google-analytics

A lightweight tool for scraping current and historic Google Analytics data

python scraper command-line google-analytics wayback-machine open-source-research

Updated Feb 16, 2024
Python

DEMON1A / Discord-Recon

Discord bot created to automate bug bounty recon, automated scans and information gathering via a discord server

automation discord hacking python3 recon nuclei bugbounty wayback-machine reconnaissance hackingtools bugbounty-tool discord-recon

Updated Jan 8, 2024
Python

Fooftilly / RSS_archiver

Download and archive RSS feeds to Wayback Machine. Save a list of archived feed in locad db.

rss archive internet-archive rss-feed archiver wayback-machine webarchive link-archiver internet-archiving rss-archive link-archive

Updated Oct 19, 2023
Python

agude / wayback-machine-archiver

A Python script to submit web pages to the Wayback Machine for archiving.

sitemap backup python3 internet-archive wayback-machine wayback-archiver

Updated Oct 8, 2023
Python

KarimPwnz / waybacked

Get URLs from the Wayback Machine. Able to handle large outputs.

security wayback-machine reconnaissance

Updated Sep 15, 2023
Python

Improve this page

Add a description, image, and links to the wayback-machine topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the wayback-machine topic, visit your repo's landing page and select "manage topics."