Skip to content

When taking a snapshot Heritrix renames crawl.log

Alex Osborne edited this page Jul 4, 2018 · 2 revisions

When taking a snapshot of a crawl, Heritrix will rename the crawl.log file.  For example, the default file 'crawl.log' will be named crawl.logXXX where XXX is a combination of a sequential id and a timestamp.  This may cause issues if you are, for example, using a script to monitor log entries by explicitly referencing  'crawl.log'.

Heritrix

Structured Guides:

Wiki index

FAQs

User Guide

Knowledge Base

Known Issues

Background Reading

Users of Heritrix

How To Crawl

Development

Clone this wiki locally