The Web Curator Tool is a tool for managing the selective web harvesting process. (moved from SourceForge). https://webcurator.slack.com
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
harvest-agent-h1
harvest-agent-h3 D1.4.6.1: Fixed bug where heartbeat was killing further validation jobs. Aug 20, 2018
wct-assembly
wct-core
wct-store
wct-submit-to-rosetta
.gitignore
.travis.yml 14: enabled tests in .travis.yml (and disabled failing jdk7 profile) Jan 19, 2018
LICENSE
development.txt
install_maven_dependencies.bat
install_maven_dependencies.sh
pom.xml
readme.md readme updated for v1.7.0 Jan 19, 2018
upgrade_config.txt

readme.md

Build Status

WCT 1.7.0 Beta

This is the WCT 1.7.0 Beta version.

Before installing

Please ensure the user that WCT uses to login to your database has the correct permissions to create temporary tables. Failure to grant this will result in problems during the purge process.

WCT changes for v1.7.0 Beta


Heritrix 3

This version of WCT is the first step in moving towards Heritrix 3.x integration. It requires a separate standalone instance of Heritrix 3.x.

While this version is marked as Beta, the National Library of New Zealand is currently using it in Production due to it's urgent need for Heritrix 3 capability. That being said, some caution is advised as extensive testing has not been finished on this version and is still some way from a functionally ideal and user friendly integration with Heritrix 3.x.

The changes in this version are located in the Harvest Agent module. Please see the readme file under webcurator/wct-harvest-agent/readme.md for further setup details and notes.