-
Notifications
You must be signed in to change notification settings - Fork 51
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Parameter to ignore robots.txt #22
Comments
Thanks @rodolphopivetta! The parameter you're asking for would be useful, but also extremely risky, imho. There are bad people out there who would love to have an extremely fast, easy to use web spider that ignores netiquette. For link-checking on your own sites (especially in staging), you can append your
Hope this helps. I'm closing the issue but feel free to reopen if I'm completely off the mar. |
Thank you @filiph, this will work for me, but just to keep note, the correct robots.txt that I've used was:
|
Thanks! I updated my answer above in order to not confuse folks. |
Congratulations by the project, this is awesome.
I'm having a specific use case with this project: I have a staging web project that I don't want it to be indexed by the google. In other words, my robots.txt have one rule to ignore all links, and in this case, I can't use the linkcheck to check that.
Would be great have a parameter that says I don't want to honors the robots.txt.
The text was updated successfully, but these errors were encountered: