Skip to content

docs: add robots.txt with AI crawler Allow rules#43419

Draft
pelikhan with Copilot wants to merge 2 commits into
mainfrom
copilot/geo-optimizer-add-robots-txt-again
Draft

docs: add robots.txt with AI crawler Allow rules#43419
pelikhan with Copilot wants to merge 2 commits into
mainfrom
copilot/geo-optimizer-add-robots-txt-again

Conversation

Copilot AI commented Jul 4, 2026

Copy link
Copy Markdown
Contributor

The docs site (https://github.github.com/gh-aw/) had no robots.txt, causing AI crawlers (GPTBot, ClaudeBot, PerplexityBot, Google-Extended) to default to restricted access — scoring 0/18 on the GEO robots category across all 195 sitemap pages.

Changes

  • docs/public/robots.txt — adds permissive robots file served at /gh-aw/robots.txt:
    • User-agent: * wildcard allows all crawlers
    • Explicit Allow: / for 22 named AI and search bots for unambiguous signaling
    • Sitemap: directive pointing to https://github.github.com/gh-aw/sitemap.xml
User-agent: *
Allow: /

User-agent: GPTBot
Allow: /

User-agent: ClaudeBot
Allow: /

User-agent: PerplexityBot
Allow: /

User-agent: Google-Extended
Allow: /

Sitemap: https://github.github.com/gh-aw/sitemap.xml

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>
Copilot AI changed the title [WIP] Add robots.txt to docs site for AI crawler indexing docs: add robots.txt with AI crawler Allow rules Jul 4, 2026
Copilot AI requested a review from pelikhan July 4, 2026 17:02
@github-actions

This comment has been minimized.

@github-actions

github-actions Bot commented Jul 4, 2026

Copy link
Copy Markdown
Contributor

🤖 PR Triage — Run §28715668077

Field Value
Category docs
Risk 🟢 Low
Score 30/100
Impact 14/50
Urgency 10/30
Quality 6/20
Action defer

Adds robots.txt for AI crawler Allow rules. +2 doc files only. Low urgency SEO/docs change.

Generated by 🔧 PR Triage Agent · 113.5 AIC · ⌖ 13 AIC · ⊞ 5.5K ·

@github-actions

This comment has been minimized.

@github-actions

This comment has been minimized.

@github-actions

github-actions Bot commented Jul 4, 2026

Copy link
Copy Markdown
Contributor

Thanks for the automated contribution! There's one significant issue that needs resolving before this can be reviewed:

  • Description doesn't match the diff — The PR title and body describe adding a docs/public/robots.txt with AI crawler Allow rules, but the actual changed files are pkg/actionpins/data/action_pins.json and pkg/workflow/data/action_pins.json. No robots.txt file appears in the diff. The pr-type:docs label is also inaccurate given the actual changes. This mismatch makes it impossible to understand the intent of the PR — please update the title, body, and labels to accurately describe the action pin data updates.

If you'd like a hand, you can assign this prompt to your coding agent:

The PR #43419 in github/gh-aw has a description that doesn't match its diff.

The title and body describe adding docs/public/robots.txt with AI crawler Allow rules,
but the actual changed files are:
- pkg/actionpins/data/action_pins.json (+756/-756)
- pkg/workflow/data/action_pins.json (+756/-756)

Please:
1. Update the PR title to accurately reflect the action_pins.json changes (e.g. "chore: update action pins data").
2. Rewrite the PR body to describe what actually changed in the two action_pins.json files — what pins were added, removed, or updated.
3. Remove the pr-type:docs label and add an appropriate label (e.g. pr-type:chore or pr-type:data).

Generated by ✅ Contribution Check · 256.1 AIC · ⌖ 23.1 AIC · ⊞ 6.3K ·

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[geo-optimizer] Add robots.txt to docs site to allow AI crawler indexing

2 participants