Skip to content

Releases: WebCuratorTool/webcurator

v3.2.1

28 Aug 00:56
Compare
Choose a tag to compare

Release notes:

Pywb Individual Collections feature

Feature gives the option of using individual collections with Pywb. This ensures only resources crawled within a harvest are loaded during replay.

  • Collections are determined at the Target Instances + Harvest Result level.
  • Configured within Store properties. See documentation for options.
  • Includes option for using a symbolic link to a Harvest Result directory, which can reduce storage and extra file copying.
  • pywbIndexer.enable property has been renamed to pywbIndexer.enabled

v3.2.0

29 Apr 02:19
Compare
Choose a tag to compare

Release notes:

  • Harvest Screenshots feature
  • Minor enhancements
    • Rosetta DPS.
    • Harvest visualization UI performance.
  • Bug fixes (#101, #65)

v3.1.4

20 Apr 23:24
Compare
Choose a tag to compare

Release notes:

  • Harvest Visualization and Analysis
    • Optimizations for the loading and performance
    • Tree views now allow operations on selected URLs and their children (fixes #76).
    • Berkley DB upgraded to latest version
    • Browse context menu now has link to Access Tool
    • The redundant HarvestResourceDTO class has been removed.
  • CDX format now configurable.
  • Default CDX now generated with a non-SURT formatted URL in the N-field (fixes #87)
  • Fix added for Harvest Agents that fail to transfer harvests completely, which result in a Target Instance stuck in the Stopping state.

v3.1.3

14 Dec 15:05
Compare
Choose a tag to compare

Release notes:

  • Update CDX format and allow users to specify an alternative format (fixes #70).
  • Enable the use of soft links (to warc files in store) in the QA wayback input directory (implements #72).
  • Fix issue in block URLs regex in profiles and profile overrides (fixes #79).

v3.1.2

23 Aug 22:25
Compare
Choose a tag to compare

v3.1.2 - patch version release

Enhancements/Fixes:

PR #59 - Fixes issue with non-alphanumeric characters in target names.
PR #62 - Remove generation of sorted crawl logs (#52).
PR #64 - Remove redundant interfaces.
PR #68 - Fix for CVE-2022-22965.

v3.1.1

19 Jul 21:45
Compare
Choose a tag to compare

v3.1.1 - patch version release

Enhancements:

Support SFTP in Submit-to-Rosetta module.
Integration with Rosetta 7.1 SDK and API in Submit-to-Rosetta module.

v3.1.0

15 Jul 03:43
Compare
Choose a tag to compare

v3.1.0 - minor version release

See release notes for details on changes

Binary package contents:

db/
database setup and upgrade scripts.

lib/
webcurator-webapp-3.0.3.war
webcurator-store-3.0.3.war
webcurator-harvest-agent-h3-3.0.3.jar
webcurator-harvest-agent-h1-3.0.3.jar

All WAR/JAR files are compiled with JDK 8

v3.0.3

04 May 01:44
Compare
Choose a tag to compare

v3.0.3 - patch version release

Bugs fixed:

  • Documentation typos fixed.
  • H3 Harvest Agent heartbeats fail when started before Webapp
  • ‘Too many files open’ error when Store is transferring files

Binary package contents:

db/

database setup and upgrade scripts.
lib/

  • webcurator-webapp-3.0.3.war
  • webcurator-store-3.0.3.war
  • webcurator-harvest-agent-h3-3.0.3.jar
  • webcurator-harvest-agent-h1-3.0.3.jar

All WAR/JAR files are compiled with JDK 8

v3.0.2

08 Apr 01:46
Compare
Choose a tag to compare

v3.0.2 - patch version release

Bugs fixed:

  • Duplicate crawl running on multiple harvest agents
  • Occasional failing to archive using Submit-to-Rosetta plugin
  • Escape special characters in Target Annotation field
  • Old bandwidth restrictions skipped for H3
  • Better handling of heartbeat failed issues
  • Extra profile validation for blocked and included URL regexs
  • Hard coding of version number to login page
  • Fix for slow deletion of harvest resources in MySQL/MariaDB

v3.0.0

03 Dec 08:49
Compare
Choose a tag to compare

v3.0.0 - major version release

See release notes for details on changes