Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Broken-Link-Checker should divide in errors/warnings #2077

Closed
Tracked by #1704
dkehne opened this issue Feb 18, 2023 · 7 comments · Fixed by #2196
Closed
Tracked by #1704

Broken-Link-Checker should divide in errors/warnings #2077

dkehne opened this issue Feb 18, 2023 · 7 comments · Fixed by #2196
Labels
💡 feature New feature or request ⁉️ prio: low Not urgent, can be resolved in the distant future. ☺️ effort: low Should be doable in <4h
Milestone

Comments

@dkehne
Copy link

dkehne commented Feb 18, 2023

Motivation

To keep it simple for municipalities the results should be divided not only in valid and invalid, but in valid, error and warning.
"Ungültig" should be renamed in "Fehlerhaft".

Proposed Solution

404 and "Ungültige URL" should be visible in error-section
all other error(-code)s should be visible in warning-section

For reference: This topic was debated earlier in #1695

@dkehne dkehne added the 💡 feature New feature or request label Feb 18, 2023
@timobrembeck timobrembeck added this to the Backlog milestone Feb 18, 2023
@timobrembeck timobrembeck added ⁉️ prio: low Not urgent, can be resolved in the distant future. ☺️ effort: low Should be doable in <4h labels Feb 18, 2023
@timobrembeck
Copy link
Member

Wouldn't it make more sense to be more honest with the division and name the categories "Not found / Nicht gefunden" and "Invalid / Ungültig"?

To me, "Warning" sounds a bit like "well, not ideal, but doesn't need my direct attention", which is not at all what this category is about. These errors are very real and the links are just not working in most cases. We can still try to minimize the false positives by contributing to the link checker library:

@osmers
Copy link

osmers commented Mar 6, 2023

How far do you think we can limit false positives? Bcs from the ones I checked, I'd say about 50% worked just fine. The other 50% were faulty. I know that some of the ones that worked are technically not 100% correct, but it's not something the Kommunen have the time to address.

@timobrembeck
Copy link
Member

Hard to estimate without trying it, I already implemented a few improvements in the link check library, which should resolve most of the false positive 403 errors... as soon as they release a new version, we can check all links again and test how much it already improved the situation.

@osmers
Copy link

osmers commented Mar 6, 2023

Alright, then let's take that step first and then have a look.
What would be even more interesting are the less understandable codes like: Connection Error: HTTPSConnectionPool(host='www.kl- or New Connection Error: Failed to establish a new connection: [Errno -2] Name or service not known (both examples from Kaiserslautern, in case you wanna have a look) bcs both pages work just fine when I click on the link
Those would be pages we'd definitely want to put under warning since they almost always work...

@timobrembeck
Copy link
Member

Hmm, those are quite unique problems:

curl -I https://www.kl-inklusiv.de/
HTTP/1.1 404 Not Found
Date: Mon, 06 Mar 2023 11:57:56 GMT
Server: waitress
Content-Length: 26
Content-Type: application/json
Via: waitress
X-Powered-By: Zope (www.zope.org), Python (www.python.org)

@osmers
Copy link

osmers commented Mar 7, 2023

Thanks for checking those out - I think it is especially these kinds of problems that Kommunen will come to us about bcs to them there is no apparent error with the page, which is why I would put those under warning (possibly with the explanation of what the warning category means)

@timobrembeck
Copy link
Member

Well, I'm still not convinced we really need a new category for these - we already have a category for false positive errors: the "ignore" category.

There will always be certain edge cases where the checker does not behave exactly like a user's browser. There is no need for region managers to contact the service team in such cases - they should just ignore a link after they manually checked it. Maybe we should put this information in the wiki? Or directly on the link checker page in the CMS?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
💡 feature New feature or request ⁉️ prio: low Not urgent, can be resolved in the distant future. ☺️ effort: low Should be doable in <4h
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants