riplink
finds dead links on the web. It's useful for double-checking web pages for incorrect, or dead web links.
Inspired by Wikimedia and the Internet Archive fixing broken links on Wikipedia.
$ go get github.com/mschwager/riplink
By default riplink
will only print links to pages that return something other than an HTTP 2XX
.
If we specify the -verbose
flag it will print all links:
$ riplink -url https://google.com -verbose
https://www.google.com/intl/en/options/ 200
https://www.google.com/imghp?hl=en&tab=wi 200
https://google.com/intl/en/policies/terms/ 200
https://google.com/intl/en/ads/ 200
https://google.com/services/ 200
...
The -depth
flag can be used to recurse into discovered links:
# Follow links up to 3 pages deep
$ riplink -url https://google.com -depth 3
...
The -same-domain
flag can be used to avoid querying links from other domains:
# Avoid links that aren't on google.com
$ riplink -url https://google.com -same-domain
...
If you're looking for specific HTTP return codes you can use the -http-code
flag:
# Only output links that return HTTP 302
$ riplink -url https://google.com -http-code 302
...
$ go test ./...