Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Article Names from Top 5000 should be validated #276

Open
Lichtschalter-5000 opened this issue Mar 20, 2021 · 0 comments
Open

Article Names from Top 5000 should be validated #276

Lichtschalter-5000 opened this issue Mar 20, 2021 · 0 comments
Labels

Comments

@Lichtschalter-5000
Copy link
Contributor

Lichtschalter-5000 commented Mar 20, 2021

image
Snapshot of the Top 5000 in the Wayback machine
(It's on rank 2454)

Because the article name was too long, it has been shortened. The result is no longer the name of an article.
The list of page names is built by retrieving the text content of the links (WikiUpdater.js:92). Instead, the content of the href-attribute could be stored (although this raises the need of more refactoring, because the previous "Albert Einstein" would now be "Albert_Einstein").
Another option would be to check whether an article really exists.

Maybe this is connected to #159, unlikely though, because the backlinks there wouldn't have worked either.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants