Crawl RSS - Heritrix 3 add-on
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
src
.gitignore
.travis.yml
LICENSE.txt
README.md
pom.xml

README.md

Crawl RSS - Heritrix 3 add-on

Build Status

NOTE: This add-on will only work with Heritrix 3.3.0 or later.

Installation

  1. Download the code

  2. Run "mvn package". This generates a distribution tar.gz file.

  3. Extract the archive from step #2 into the root directory of a Heritrix (3.3.0+) instance

  4. Startup Heritrix as usual

  5. Base your job on the supplied profile "CrawlRSS-Sample-Profile"