Releases: secutils-dev/secutils-web-scraper
Releases · secutils-dev/secutils-web-scraper
1.0.0-beta.1 (2024-05-20)
Refer to #secutils/v1.0.0-beta.1 for the full Secutils.dev release notes.
Features
- platform: support configurable
userAgent
header viaSECUTILS_WEB_SCRAPER_USER_AGENT
envvar (030c8d9)
Bug Fixes
- platform: set proper path to the
main
module inpackage.json
(47aeda2)
Full Changelog: v1.0.0-alpha.5...v1.0.0-beta.1
v1.0.0-alpha.5 (2024-01-10)
Refer to #secutils/v1.0.0-alpha.5 for the full Secutils.dev release notes.
Maintenance release (dependency upgrades and other chores).
Full Changelog: v1.0.0-alpha.4...v1.0.0-alpha.5
v1.0.0-alpha.4 (2023-12-26)
Refer to #secutils/v1.0.0-alpha.4 for the full Secutils.dev release notes.
Features
- web-page: add support for custom request HTTP headers (6a743ea)
- web-page: disable browser cache and selectively proxy requests to bypass CSP/CORS restrictions (6825861)
- web-page: disable CORS with
--disable-web-security
Chromium launch flag (f9507eb) - web-page: introduce new Web Content scraper API (a7d9de0)
Fixes
- web-page: bump web page
load
timeout from 5000ms to 10000ms (a9ee6ad) - web-page: serialize content state even if it has exceeded allowed size (7df4534)
- web-page: use only web page URL, headers, and user scripts to calculate cache key (bc163f5)
- web-page: use proper cryptographic hash for the response cache key (bec0919)
Enhancements
- web-page: use more stable Chrome DevTools Protocol to capture external resources (129ca5a)
- web-page: pretty print HTML content (cb613f4)
- web-page: use stable JSON stringifier to persist web page extracted content (5165a83)
Full Changelog: v1.0.0-alpha.3...v1.0.0-alpha.4