Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
URLs could be linked from multiple locations #8
Suppose we have two valid URLs A & B. Both of them contain a link to a third URL Z, which is a 404.
In the report, we don't find out that B had a broken link to Z.
Is that correct? If so, perhaps a
Oh, I see how this could be useful, so that someone could see each instance of the broken link and go to all those pages to fix it.
The implementation solution would work as well, but I'm wondering how that might be displayed to the user. Maybe something that represents "link was referenced by these pages", instead of "parent"
The other way to look at this is as a one-directional tree. We could remove the
I think this model more closely represents a web page, since each web page has zero or more links to other pages, but there's no in-built concept of pages that link to this page.
I guess the question we should ask is which representation is more useful when displaying the results?
Given that we are looking to include this program potentially as a browser plugin or as a web service, where a user can ask to scan a single page or a whole site, I'm leaning towards the latter: representing each webpage as a collection of links.
If we do this, we could potentially present the whole website as a graph, with nodes as WebPage and edges as links. I think it would be neat to look at, at the very least.