Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parameter to ignore robots.txt #22

Closed
rodolphopivetta opened this issue Sep 12, 2018 · 3 comments
Closed

Parameter to ignore robots.txt #22

rodolphopivetta opened this issue Sep 12, 2018 · 3 comments

Comments

@rodolphopivetta
Copy link
Contributor

Congratulations by the project, this is awesome.

I'm having a specific use case with this project: I have a staging web project that I don't want it to be indexed by the google. In other words, my robots.txt have one rule to ignore all links, and in this case, I can't use the linkcheck to check that.

Would be great have a parameter that says I don't want to honors the robots.txt.

@filiph
Copy link
Owner

filiph commented Sep 19, 2018

Thanks @rodolphopivetta! The parameter you're asking for would be useful, but also extremely risky, imho. There are bad people out there who would love to have an extremely fast, easy to use web spider that ignores netiquette.

For link-checking on your own sites (especially in staging), you can append your robots.txt to allow linkcheck.

User-agent: linkcheck
Disallow:

Hope this helps. I'm closing the issue but feel free to reopen if I'm completely off the mar.

@filiph filiph closed this as completed Sep 19, 2018
@rodolphopivetta
Copy link
Contributor Author

Thank you @filiph, this will work for me, but just to keep note, the correct robots.txt that I've used was:

User-agent: linkcheck
Disallow:

User-agent: *
Disallow: /

@filiph
Copy link
Owner

filiph commented Sep 19, 2018

Thanks! I updated my answer above in order to not confuse folks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants