Skip to content

Coolguyzone/chore/ai rules robots#17796

Merged
coolguyzone merged 3 commits into
masterfrom
coolguyzone/chore/ai-rules-robots
May 19, 2026
Merged

Coolguyzone/chore/ai rules robots#17796
coolguyzone merged 3 commits into
masterfrom
coolguyzone/chore/ai-rules-robots

Conversation

@coolguyzone
Copy link
Copy Markdown
Contributor

@coolguyzone coolguyzone commented May 18, 2026

DESCRIBE YOUR PR

This PR addresses some of the issue found here: https://www.mintlify.com/score/sentry

These updates to robots.txt add AI bot rules and content signals to help agents better navigate our docs.

More on content signals: https://contentsignals.org/

Some details of this change:

  • All bots get Allow: / — appropriate for public documentation
  • The wildcard User-agent: * block now also has an explicit Allow: /
    (previously it had no rule at all, which is technically ambiguous)
  • If we ever wants to block a specific crawler, it's a one-line change to Disallow: /.
  • isDeveloperDocs is now evaluated at module load time rather than inside the
    handler, which is fine since it's a build-time env var

IS YOUR CHANGE URGENT?

Help us prioritize incoming PRs by letting us know when the change needs to go live.

  • Urgent deadline (GA date, etc.):
  • Other deadline:
  • None: Not urgent, can wait up to 1 week+

SLA

  • Teamwork makes the dream work, so please add a reviewer to your PRs.
  • Please give the docs team up to 1 week to review your PR unless you've added an urgent due date to it.
    Thanks in advance for your help!

PRE-MERGE CHECKLIST

Make sure you've checked the following before merging your changes:

  • Checked Vercel preview for correctness, including links
  • PR was reviewed and approved by any necessary SMEs (subject matter experts)
  • PR was reviewed and approved by a member of the Sentry docs team

LEGAL BOILERPLATE

Look, I get it. The entity doing business as "Sentry" was incorporated in the State of Delaware in 2015 as Functional Software, Inc. and is gonna need some rights from me in order to utilize my contributions in this here PR. So here's the deal: I retain all rights, title and interest in and to my contributions, and by keeping this boilerplate intact I confirm that Sentry can use, modify, copy, and redistribute my contributions, under Sentry's choice of terms.

EXTRA RESOURCES

@vercel
Copy link
Copy Markdown

vercel Bot commented May 18, 2026

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
develop-docs Ready Ready Preview, Comment May 19, 2026 8:31pm
sentry-docs Ready Ready Preview, Comment May 19, 2026 8:31pm

Request Review

Copy link
Copy Markdown
Member

@sergical sergical left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

noice

Comment thread app/robots.txt/route.ts Outdated
Allow: /
Content-Signal: ai-train=yes, search=yes, ai-input=yes

User-agent: Claude-Web
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
User-agent: Claude-Web
User-agent: ClaudeBot

Comment thread app/robots.txt/route.ts Outdated
Comment on lines +40 to +46
User-agent: Bytespider
Allow: /
Content-Signal: ai-train=yes, search=yes, ai-input=yes

User-agent: CCBot
Allow: /
Content-Signal: ai-train=yes, search=yes, ai-input=yes
Copy link
Copy Markdown
Contributor

@sfanahata sfanahata May 19, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we want to explicitly allow trainers to train on our docs? I kind of feel like no.

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i had the same feeling but then i thought if they don't train on our current docs, then LLMs will keep showing people old ways of using Sentry which might not be ideal 😅

Copy link
Copy Markdown
Contributor

@sfanahata sfanahata May 19, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah, good point. I guess train is a wide range of things. It's too early in content-signal land to be explicit. Sounds like content-signal is all experimental and not adopted anyway, so not sure it'll actually prevent any bot from taking an action.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, I had this discussion with Matt, it feels like are direction so far has been to optimize the docs for LLM usage, so training seemed like the right call. I'm open to hearing any objections though.

Comment thread app/robots.txt/route.ts
Sitemap: ${isDeveloperDocs ? 'https://develop.sentry.dev/sitemap.xml' : 'https://docs.sentry.io/sitemap.xml'}
Sitemap: ${sitemap}

User-agent: *
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@coolguyzone since we're wildcard allowing every bot, I'm not sure it makes sense to add in more specific bots that we're also carte blanche allowing? Beside the trainers, which we might want to update in what they're allowed to do, I wonder if we remove the explicits of each bot? Otherwise, we're still missing some, like

  • ChatGPT-User
  • PerplexityBot
  • Perplexity-User
  • Meta-ExternalAgent
  • cohere-ai
  • Diffbot

And that list will continue to get longer/need updating as more bots are created.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, in the future it might make sense if we are adding different rules for different bots but you're right, for now it makes sense just to collapse everything to a wildcard.

@coolguyzone coolguyzone merged commit 09456ba into master May 19, 2026
23 checks passed
@coolguyzone coolguyzone deleted the coolguyzone/chore/ai-rules-robots branch May 19, 2026 21:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants