Skip to content

Releases: secutils-dev/secutils-web-scraper

1.0.0-beta.1 (2024-05-20)

20 May 09:59
Compare
Choose a tag to compare
Pre-release

Refer to #secutils/v1.0.0-beta.1 for the full Secutils.dev release notes.

Features

  • platform: support configurable userAgent header via SECUTILS_WEB_SCRAPER_USER_AGENT envvar (030c8d9)

Bug Fixes

  • platform: set proper path to the main module in package.json (47aeda2)

Full Changelog: v1.0.0-alpha.5...v1.0.0-beta.1

v1.0.0-alpha.5 (2024-01-10)

10 Jan 16:11
Compare
Choose a tag to compare
Pre-release

Refer to #secutils/v1.0.0-alpha.5 for the full Secutils.dev release notes.

Maintenance release (dependency upgrades and other chores).

Full Changelog: v1.0.0-alpha.4...v1.0.0-alpha.5

v1.0.0-alpha.4 (2023-12-26)

26 Dec 20:05
Compare
Choose a tag to compare
Pre-release

Refer to #secutils/v1.0.0-alpha.4 for the full Secutils.dev release notes.

Features

  • web-page: add support for custom request HTTP headers (6a743ea)
  • web-page: disable browser cache and selectively proxy requests to bypass CSP/CORS restrictions (6825861)
  • web-page: disable CORS with --disable-web-security Chromium launch flag (f9507eb)
  • web-page: introduce new Web Content scraper API (a7d9de0)

Fixes

  • web-page: bump web page load timeout from 5000ms to 10000ms (a9ee6ad)
  • web-page: serialize content state even if it has exceeded allowed size (7df4534)
  • web-page: use only web page URL, headers, and user scripts to calculate cache key (bc163f5)
  • web-page: use proper cryptographic hash for the response cache key (bec0919)

Enhancements

  • web-page: use more stable Chrome DevTools Protocol to capture external resources (129ca5a)
  • web-page: pretty print HTML content (cb613f4)
  • web-page: use stable JSON stringifier to persist web page extracted content (5165a83)

Full Changelog: v1.0.0-alpha.3...v1.0.0-alpha.4