Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: exceeding maxSessionRotations calls failedRequestHandler #2029

Merged
merged 2 commits into from Aug 9, 2023

Conversation

barjin
Copy link
Contributor

@barjin barjin commented Aug 9, 2023

Any request exceeding the maxSessionRotations limit currently kills the crawler. This was intended for early exit on too many hard proxy errors, but proved to be somewhat confusing for users using retryOnBlocked (any page that the crawler cannot access due to bot protection kills the run).

With this PR, the requests that cross the limit of maxSessionRotations now get processed with failedRequestHandler. Not sure if this is breaking (definitely might confuse a dev or two), but it's in line with how the crawlers worked before the SessionError update (so... it's actually reverting the hidden breaking change?)

Closes #2028

@B4nan B4nan merged commit b1cb108 into master Aug 9, 2023
7 checks passed
@B4nan B4nan deleted the feat/failed-request-handler-on-session-error branch August 9, 2023 15:16
@abhisheksurve45
Copy link

@B4nan Can we release it?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Crawling terminated due to session error on using retryOnBlocked
3 participants