Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: retry on href not host #357

Merged
merged 1 commit into from
Nov 12, 2021

Conversation

bcoe
Copy link
Contributor

@bcoe bcoe commented Nov 12, 2021

Managed to reproduce [0] crawl issue locally using the following test repository:

https://github.com/bcoe/external-link-repo

Designed specifcially to trigger the bug.


tldr; there was a slight bug with the retry logic, rather than retrying failed links, it was only caching one entry per host. This meant that when a burst of errors occurred, we would immediately hit the stop condition and not actually retry the failed URLs.

I have not been able to recreate the failure when running locally with this fix.

@JustinBeckwith JustinBeckwith merged commit 54a1994 into JustinBeckwith:main Nov 12, 2021
@github-actions
Copy link

🎉 This PR is included in version 2.16.1 🎉

The release is available on:

Your semantic-release bot 📦🚀

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants