Skip to content

Releases: webrecorder/archiveweb.page

ArchiveWeb.page App 0.6.12

20 Aug 20:15
eddd859
Compare
Choose a tag to compare

Fixes include:

  • Optimization: Better handling of large HTML (don't serialize to string)
  • Fix storing POST requests with original POST data
  • Don't convert POST to get when replaying POST requests (via wabac.js 2.8.0-beta.0)
  • Fidelity improvements via (wabac.js 2.8.0-beta.0)

ArchiveWeb.page App 0.6.11

06 Aug 23:55
f6f1dfe
Compare
Choose a tag to compare

Support for Apple M1 chip macs with universal binary for OSX.
Downloader: WACZ downloads store WARCs with .warc.gz instead of .warc extension to avoid confusion.

ArchiveWeb.page App 0.6.10

23 Jul 17:01
Compare
Choose a tag to compare

Various bug fixes:

  • Fidelity improvements for replay
  • Improved async fetching (first try directly, if errors, try via autofetch behavior in frame)
  • Update to latest behaviors, fix for twitter timeline photo iteration.
  • Update to latest Ruffle

ArchiveWeb.page App 0.6.8

06 Jun 06:18
Compare
Choose a tag to compare
  • Fixes IPFS sharing:
  • update to latest IPFS version (0.7.1)
  • waiting for IPFS sharing to complete
  • Support for recording window.devicePixelRatio
  • Fidelity improvements with wabac.js 2.7.8

ArchiveWeb.page App 0.6.7

19 May 01:50
Compare
Choose a tag to compare

Fixes pages opened in new windows or downloads sometimes not being archived due to CORS issues.

Replay improvements in sync with ReplayWeb.page 1.4.3

ArchiveWeb.page App 0.6.6

10 May 23:47
Compare
Choose a tag to compare

Changes include:

  • Bug fix: Fix for crash when recording certain pages with service workers (#33)
  • WACZ: Use SURT-ordered CDX again, properly sorted.
  • WARC: use sha256 prefix instead of sha-256:
  • Autopilot Behaviors: Update to latest browsertrix behaviors (0.2.1), includes fix for facebook behavior
  • Dependencies: Update to latest ruffle.js, warcio.js (1.4.5), wabac.js (2.7.3)

ArchiveWeb.page App 0.6.4

26 Apr 22:35
4c748fb
Compare
Choose a tag to compare

Fidelity Improvements:

  • Fixed replay of some Tableau dashboards (multipart/form-data replayed correctly)
  • PDF: fixed capture and text extraction of PDFs
  • Downloads: fixed capture of file downloads that were previously reported as failed.

Downloading WARC/WACZ improvements:

  • Uses new WACZ 1.1 format
  • Uses SHA-256 instead of MD5
  • WARC files are deterministic on every download (unless changed), using uuiv5 for consistent record ids.
  • Includes hashes for all WARC records and compressed cdx blocks
  • Includes hash of datapackage.json and signature in datapackage-digest.json

ArchiveWeb.page App 0.6.3

22 Apr 16:56
Compare
Choose a tag to compare

Fidelity: Capture Improvements for PUT/POST requests, support longer URLs (fixes replay of Yahoo Answers)
Optimizations: Update page and total sizes periodically instead of after every request, better performance with multiple tabs/windows running behaviors.

ArchiveWeb.page App 0.6.2

21 Apr 18:31
Compare
Choose a tag to compare

Behavior Improvements:

  • Fix start/pause in App
  • Faster start time (only waiting for pending network requests, not internal async fetches).
  • Add experimental Facebook behavior!
  • Fix Behaviors only working on English sites (Instagram, Twitter)

UI:

  • fix page selection, fixes #25
  • show 'Loading Archives...' until first API response succeeds.
  • add message when finishing recording, wait for requests to finish (but not more than 20 seconds)
  • fix for location bar not always loading entered URL (from replayweb.page)

IPFS:

  • use trickle dag
  • link default to replayweb.page link as gateway links not yet reliable for partial loading

ArchiveWeb.page App 0.6.0

12 Apr 16:05
3bc4260
Compare
Choose a tag to compare

Initial support for Autopilot behaviors!

The supported behaviors included Instagram, Twitter and generic autoscroll (from browsertrix-behaviors)

Behaviors are still new and in process of being tested.

Note: Currently, the Twitter behavior only works on English localizations of Twitter, as it relies on text to detect the structure of the page.