Overview

This is a project dedicated to checking for certain errors on the HeroArts website.

Duplicate Checker

One part of the project scans through the HeroArts Sitemap, looking for duplicate products, pages, collections, and blogs.

Broken Link Checker

The other part of this project scans through the entire HeroArts site, starting from the home page, looking for any links that return any sort of error.

It functions by running a recursive algorithm in the following order:

Pull all HTTP hyperlinks from the current page by scanning through the HTML source code.
For each link, check to see if it is accessible, and if so, get the HTTP response code. a) If the link is accessible AND returns an OK response code, start a new thread* that starts over at step (1) with this new link. b) If not, flag the link as broken, printing out the page on which is is found and a processed version of its HTML element.

*This algorithm takes advantage of multithreading techniques, searching through multiple pages at the same time, to streamline the process.

NOTE:

Before running any part of this program again, double-check the code with the heroarts site to make sure nothing of significance has changed. For example, make sure the sitemap has not grown past the number of pages that the code thinks it is. It might be a good idea to remove the hard-coding of some of these cases (for example, having the program check for the number of sitemap pages each time before running), if this will be used in the future.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
output		output
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Overview

Duplicate Checker

Broken Link Checker

NOTE:

About

Uh oh!

Releases

Packages

Languages

maxemerling/WebChecker

Folders and files

Latest commit

History

Repository files navigation

Overview

Duplicate Checker

Broken Link Checker

NOTE:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages