-
Notifications
You must be signed in to change notification settings - Fork 9.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
robots.txt is not valid- Lighthouse was unable to download a robots.txt file #12936
Comments
Hi @owaishassan72, I've just ran the Lighthouse PSI from my end using v8.0 and it confirms that your robots.txt is valid (please see screenshot.) From this, I assume that it may be an issue with the version of Lighthouse you're using. Maybe try updating to v8.0 and giving it a try and see if it recognises it on your end. |
I'm seeing this on web.dev/measure as well: https://lighthouse-dot-webdotdevsite.appspot.com//lh/html?url=https%3A%2F%2Fwww.aleksandrhovhannisyan.com%2F |
Hi Aleksandr, I've ran your site through the DevTools Lighthouse (v7.5) and it detects your robots.txt fine: I've also ran it through the PSI Extension (v.8.0) and I'm getting the same message you are. For your case it seems the other way round - not sure why the PSI isn't detecting your robots.txt as the syntax looks correct to me. I've also checked it with Ryte's scanner and it crawls it fine. You can also check it with Google's robots.txt scanner too and other site scanners to be sure. Hopefully Google can diagnose what the issue is @connorjclark Hope this helps, |
Yes, this looks like a new PSI issue. From HTTPArchive/httparchive.org#422, https://httparchive.org/ is also failing. Confirmed with the direct API request to PSI updated to Chrome 92 yesterday, so this is hitting a new path (for PSI) in the lighthouse/lighthouse-core/gather/gatherers/seo/robots-txt.js Lines 41 to 56 in 244364c
|
Started seeing this error since yesterday. It now disappeared on one site https://geek.pontik.com but still appears in the sub domain https://radio.pontik.com and on the main site https://pontik.com. These are in separate servers. |
This should be fixed for everyone now. Keeping open to track a permanent fix. |
I am still seeing this as an issue with all WordPress Multisite installs even though there is a robots.txt file that can be viewed in the browser and no issues seen with Google robots testing tool. Example content of robots.txt for https://bloxhams.co.nz/ and https://bloxhams.co.nz/robots.txt |
I got the same using latest (master) LH, but only for the first attempt. Subsequent runs fetched the robots.txt successfully. If you can consistently repro with the LH node CLI, please run with |
FAQ
URL
zomusk.com
What happened?
I was running Lighthouse on my website and saw the error that "robots.txt in invalid" even though it is NOT.
I have tested my robots.txt file and it is fine and as per google's expectations.
What did you expect?
I expected to not face this issue as there is no issue with my robots.txt file.
What have you tried?
No response
How were you running Lighthouse?
Chrome DevTools
Lighthouse Version
7.5.0
Chrome Version
No response
Node Version
No response
Relevant log output
No response
The text was updated successfully, but these errors were encountered: