Skip to content
This repository has been archived by the owner on Apr 4, 2018. It is now read-only.

alphagov/bad_link_finder

Repository files navigation

Bad link finder

Crawls a mirrored site and checks that all links either return a successful response or redirect to somewhere that does.

Usage

Set environment variables:

  • MIRROR_DIR, to the location of your mirrored site. Local files will be read in alphabetical order.
  • REPORT_OUTPUT_FILE, to the location you'd like the CSV saved to.
  • SITE_HOST, to the full host of the live site you'd like to test against, including protocol. For example, https://www.example.com
  • START_FROM, optional, specify a file on disk from which to start, e.g. example-folder/example.html. Will check this file and all following files. Useful for restarting aborted runs.

Then execute bad_link_finder.