Core Python Web Archiving Toolkit for replay and recording of web archives
-
Updated
Nov 5, 2024 - JavaScript
Core Python Web Archiving Toolkit for replay and recording of web archives
Streaming WARC/ARC library for fast web archive IO
A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machine
Browse emulated browsers connected to old web sites in your browser!
Parse And Create Web ARChive (WARC) files with node.js
Various examples of notebooks for working with web archives with the Archives Unleashed Toolkit, and derivatives generated by the Archives Unleashed Toolkit.
A Python utility for publishing a social media story built from archived web pages to multiple services.
Create "perfect" snapshots of web pages
Add-On for Google Sheets to help those working with web archives.
This module builds our Waybacks in the various different configurations we require.
Create Robust Links from within Zotero
A service that provides archive-aware oEmbed-compatible embeddable surrogates (social cards, thumbnails, etc.) for archived web pages (mementos).
Parse CDXJ(https://github.com/oduwsdl/ORS/wiki/CDXJ) files with node.js
Process web archives (WARC format) with StormCrawler and index content into Elasticsearch or Solr
A collection of the scripts and notebooks I wrote as part of my Data Science Bootcamp capstone project
Python scripts to generate static navigation pages from collection list and insert Web Archives records using the Archive-It CDX
宁波凯思奥教育科技有限公司
PalaceRadio | A Next.js app Built from web Archive | Freelance Project @upwork
Add a description, image, and links to the web-archives topic page so that developers can more easily learn about it.
To associate your repository with the web-archives topic, visit your repo's landing page and select "manage topics."