Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Respect Robots.txt Files #21

Closed
mgifford opened this issue Apr 5, 2021 · 1 comment
Closed

Respect Robots.txt Files #21

mgifford opened this issue Apr 5, 2021 · 1 comment

Comments

@mgifford
Copy link

mgifford commented Apr 5, 2021

Sites should respect the robots.txt files that some sites use to manage traffic.

Would be great if by default the scanner respected the wishes of the site owner.

@younglim
Copy link
Collaborator

We have developed a feature to follow robots.txt with -r flag when running node cli

  -r, --followRobots                 Option for crawler to adhere to robots.txt
                                     rules if it exists
                                 [string] [choices: "yes", "no"] [default: "no"]

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants