-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bingbot cannot index (or scrape) WPcom Simple sites #83341
Comments
Initial interactions: 7163433-zd-a8c Update to the P2 here: peGwbA-Oi-p2#comment-1679 Edit: looks like the script didn't catch the link above. Opening the comment to edit changes the link automagically |
Support References This comment is automatically generated. Please do not edit it.
|
Adding a quick update: this link to a Sep 2023 announcement covers content usage controls. |
7174793-zd-a8c |
7162106-zd-a8c |
7151777-zen |
7220012-zen |
7191656-zd-a8c |
7229871-zen |
7205505-zd-a8c |
7293184-zen |
7329176-zen |
7334644-zen |
A report of this here: https://wordpress.com/forums/topic/blocking-bingbot-in-robots-txt-why/?view=all#post-4004393 |
Another report; 81441-odie |
Another report- 7344690-zen |
Another report, self-reporting |
7360087-zen |
Another report |
7232492-zen |
7411333-zen |
7412303-zen |
7414145-zen |
7471023-zd-a8c |
7472791-zd-a8c |
7453782-zen |
Another report: 7497842-zen |
7493070-zen |
070306-Zen |
Another report: #7560761-zen |
7581254-zen |
7581664-zen |
7669127-Zen |
7658463-zen |
Closing this issue as the Bingbot is no longer blocked on WordPress.com sites. |
Predef
Please see this internal P2 for details on our predef: p7DVsv-j5F-p2#comment-48067.
This issue will be updated with additional context as needed.
Quick summary
We have determined that Microsoft is using its generic
Bingbot
crawler to scrape sites. They have not yet documented a way to block the scraping behavior, so for the moment, we have blockedBingbot
from indexing Simple sites via robots.txt directives.We've created this issue to track support interactions related to this.
Steps to reproduce
N/A
What you expected to happen
N/A
What actually happened
N/A
Impact
Some (< 50%)
Available workarounds?
No but the platform is still usable
Platform (Simple and/or Atomic)
No response
Logs or notes
No response
The text was updated successfully, but these errors were encountered: