Xenu Link Checker

Xenu Link Checker checks the links discovered by Xenu Link Sleuth. The purpose of this tool is to gain visibility into broken links on "v2" (or later) website launches, so that you can know ahead of time whether you have completed a 1-to-1 mapping of old site urls to new site urls.

This tool operates by reading the .txt (TSV) export of Xenu Link Sleuth crawl and checks each link with some concurrency.

Features:

Follow redirects to see if they result in an OK response.
Check the contents of the page for a regex and produce a warning.
Run the link checker from a CD pipeline or other automation system to monitor progress in implementing all old links.
Filter links from the .txt TSV file by JMESPath expression.
Transpose the domain part of all links to another domain, such as a dev or test site.
Checks are issued with configurable levels of concurrency
Emits a JSON file which you can futher-process in your CD pipeline

Filter Expressions

The --filter option allows you to filter by field in the TSV file that Xenu Outputs. Here are some fields available in the file:

Field	Description/Example
`Address`	`https://www.iana.org/`
`Status-Code`	`200`
`Type`	`text/html`, `image/gif`, `application/javascript`
`Size`	`6735` (bytes)
`Title`	`Internet Assigned Numbers Authority`
`Links In`	`42`
`Links Out`	`0`

Example Usage

Check text/html links from mywebsite-com.txt for the presence of the regex /_next/ in the content

npx -p @wheatstalk/xenu-checker xenu-checker \
  check \
  --filter "Type == 'text/html'" \
  --transpose-domain https://test.mywebsite.com \
  --check-regex _next \
  --concurrency 100 \
  mywebsite-com.txt

Name		Name	Last commit message	Last commit date
Latest commit History 368 Commits
.github		.github
.projen		.projen
bin		bin
src		src
test		test
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.mergify.yml		.mergify.yml
.npmignore		.npmignore
.projenrc.js		.projenrc.js
.versionrc.json		.versionrc.json
API.md		API.md
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
package.json		package.json
tsconfig.jest.json		tsconfig.jest.json
tsconfig.json		tsconfig.json
version.json		version.json
yarn.lock		yarn.lock

License

wheatstalk/xenu-checker

Folders and files

Latest commit

History

Repository files navigation

Xenu Link Checker

Filter Expressions

Example Usage

About

Resources

License

Stars

Watchers

Forks

Languages