Skip to content

Latest commit

 

History

History
256 lines (155 loc) · 11.1 KB

CHANGELOG.md

File metadata and controls

256 lines (155 loc) · 11.1 KB

Changelog

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog and this project adheres to Semantic Versioning.

[Unreleased]

Changed

Fixed

  • Fix crawler.response returning null when connecting to specific chrome instance #354.
  • Fix crawler failure to follow urls with # hashes in them #332.
  • Fix crawler pending indefinitely when mixed content is present #260.

Security

  • Fix: 🔒 high-severity lodash vulnerability #339.
  • Fix: 🔒 update jquery and lodash to fix Prototype Pollution vulnerability.
  • Fix: 🔒 update puppeteer to fix Use After Free vulnerability #350.
  • Fix: 🔒 update JQuery to fix XSS vulnerability

[1.8.0] - 2018-06-11

Added

Changed

  • Drop newpage event.
  • Update Puppeteer version to 1.5.0.

Fixed

  • Fix a bug of not marking skipped requests correctly.
  • Fix requestfinished event's argument as described in the API reference.

[1.7.0] - 2018-05-14

Added

  • Support cookies for crawler.queue()'s options.
  • Make onSuccess pass cookies in the response.

Changed

[1.6.0] - 2018-04-21

Added

  • Support viewport and skipRequestedRedirect for crawler.queue()'s options.
  • Emit requestdisallowed event.
  • Make onSuccess pass redirectChain in the response.

Changed

Fixed

  • Fix a bug of silently failing to insert jQuery due to CSP.

[1.5.0] - 2018-03-25

Added

Fixed

  • Fix a bug of not allowed to set timeout option per request.
  • Fix a bug of crawling twice if one url has a trailing slash on the root folder and the other does not.

[1.4.0] - 2018-02-24

Added

  • Support browserCache for crawler.queue()'s options.
  • Support depthPriority option again.

[1.3.4] - 2018-02-22

Changed

[1.3.3] - 2018-02-21

Added

  • Emit newpage event.
  • Support deniedDomains and depthPriority for crawler.queue()'s options.

Changed

  • Allow allowedDomains option to accept a list of regular expressions.

[1.3.2] - 2018-01-19

Added

Fixed

  • Fix a bug of not showing console message properly.

[1.3.1] - 2018-01-14

Fixed

  • Fix a bug of listing response properties as methods.
  • Fix a bug of not obeying robots.txt.

[1.3.0] - 2018-01-12

Added

Changed

[1.2.5] - 2018-01-03

Added

Changed

  • Make cache to be required for HCCrawler.connect() and HCCrawler.launch()'s options.
  • Provide skipDuplicates to remember and skip duplicate URLs, instead of passing null to cache option.
  • Modify BaseCache interface.

[1.2.4] - 2017-12-25

Added

  • Support CSV and JSON Lines formats for exporting results
  • Emit requeststarted, requestskipped, requestfinished, requestfailed, maxdepthreached, maxrequestreached and disconnected events.
  • Improve debug logs by tracing public APIs and events.

Changed

  • Allow onSuccess and evaluatePage options as null.
  • Change crawler.isPaused, crawler.queueSize, crawler.pendingQueueSize and crawler.requestedCount from read-only properties to methods.

Fixed

  • Fix a bug of ignoring maxDepth option.

[1.2.3] - 2017-12-17

Changed

  • Refactor by changing tye style of requiring cache directory.

Fixed

  • Fix a bug of starting too many crawlers more than maxConcurrency when requests fail.

[1.2.2] - 2017-12-16

Added

  • Automatically collect and follow links found in the requested page.
  • Support maxDepth for crawler.queue()'s options.

[1.2.1] - 2017-12-13

Added

[1.2.0] - 2017-12-11

Changed

[1.1.2] - 2017-12-10

Added

[1.1.1] - 2017-12-09

Added

Changed

  • Automatically dismisses dialog.
  • Performance improvement by setting a page parallel.

[1.1.0] - 2017-12-08

Added

Changed

  • Public API to launch a browser has changed. Now you can launch browser by HCCrawler.launch().
  • Rename shouldRequest to preRequest for crawler.queue()'s options.
  • Refactor by separating HCCrawler and Crawler classes.
  • Refactor handlers for options.

[1.0.0] - 2017-12-05

Added

Changed

  • Migrate from NPM to Yarn.
  • Refactor helper to class static method style.