Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Robots.txt] SimpleRobotRulesParser main() to follow five redirects #428

Conversation

sebastian-nagel
Copy link
Contributor

@sebastian-nagel sebastian-nagel commented Jun 16, 2023

Follow "five consecutive redirects" when fetching robots.txt over HTTP as required by RFC 9309.

when fetching robots.txt over HTTP as required by RFC 9309
@sebastian-nagel sebastian-nagel force-pushed the robots-main-to-follow-five-redirects branch from b4b6216 to 9412dff Compare June 16, 2023 15:21
@sebastian-nagel sebastian-nagel marked this pull request as ready for review June 16, 2023 15:22
Copy link
Contributor

@jnioche jnioche left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@jnioche jnioche merged commit d685baf into crawler-commons:master Jul 11, 2023
3 of 4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants