Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head
-
Updated
May 19, 2020 - JavaScript
Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head
Parse And Create Web ARChive (WARC) files with node.js
Quick Cache and Archive search buttons
A social media open post web archiving tool
An archival thumbnail visualization server
News Archiver, Data Aggregation for CNN and Fox News
record current active tab on webrecorder.io
Client app for httpreserve pkg that generates CSV, JSON, HTTP, and BoltDB
Parse CDXJ(https://github.com/oduwsdl/ORS/wiki/CDXJ) files with node.js
Add a description, image, and links to the webarchiving topic page so that developers can more easily learn about it.
To associate your repository with the webarchiving topic, visit your repo's landing page and select "manage topics."