You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This automated GEO (Generative Engine Optimization) audit evaluated AI-engine citation readiness for the project homepage and documentation site. Both targets are in the Foundation band — accessible to AI crawlers, but missing key signals that enable citations in ChatGPT, Perplexity, and Google AI Overviews.
Extensive AI bot permissions — GitHub's robots.txt explicitly allows 27 AI crawlers including GPTBot, ClaudeBot, PerplexityBot, Google-Extended (15/18). No bots are blocked.
Perfect meta tags on docs site — Title, description, canonical, Open Graph title/description/image all present (14/14).
Rich schema markup on docs site — WebSite, Organization, SoftwareApplication, and FAQPage JSON-LD schemas (12/16). FAQPage depth of 4 boosts Q&A-style citations.
Strong signals on docs site — Language tag, RSS feed (/blog/rss.xml), and freshness date all present (6/6).
Google AI Overviews score 80/100 on docs site — solid meta + schema combination already meets Google's AI requirements.
🚨 Critical Gaps
No robots.txt on docs site (0/18) — github.github.com/gh-aw has no robots.txt. AI crawlers receive no explicit permission signals. This affects all 188 sitemap pages — every page scores 0 on this dimension.
No llms.txt on docs site (0/18) — The docs site has no LLM-friendly structured summary. The README has one; the docs site needs its own at https://github.github.com/gh-aw/llms.txt.
Zero AI discovery endpoints (0/6 on both targets) — Neither site exposes /.well-known/ai.txt, /ai/summary.json, /ai/faq.json, or /ai/service.json.
No JSON-LD schema on README/GitHub page (0/16) — GitHub's repo page injects no structured data (WebSite, Organization, FAQPage). This is partially outside the project's direct control.
Keyword stuffing penalties — github at 2.8% density on docs site; issue at 22.2% on README (driven by GitHub UI boilerplate). Both trigger a -3 negative score penalty.
Sitemap-wide Critical band — 19 of 20 audited pages score Critical (27–33/100). Root cause: missing robots.txt and llms.txt cascade to every page.
🔧 Recommended Fixes
Ordered by estimated impact:
Priority
Fix
Est. Points
Scope
🔴 1
Add robots.txt to github.github.com/gh-aw allowing AI crawlers
+18/page
All 188 pages
🔴 2
Create /llms.txt for docs site: gh aw geo llms --base-url https://github.github.com/gh-aw
Root cause: Every page scores 0 on Robots.txt and llms.txt (site-wide gaps). Adding robots.txt and llms.txt to the docs site would immediately lift all 188 pages by up to 36 points.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
GEO Audit Report — github/gh-aw
Audit Date: 2026-06-11
Run: §27365547349
This automated GEO (Generative Engine Optimization) audit evaluated AI-engine citation readiness for the project homepage and documentation site. Both targets are in the Foundation band — accessible to AI crawlers, but missing key signals that enable citations in ChatGPT, Perplexity, and Google AI Overviews.
📊 Scores
github.github.com/gh-aw/)github.com/github/gh-aw)✅ Top Strengths
llms.txton README — 2,919 words, 12 sections, 117 links (14/18). Excellent LLM-friendly project summary.robots.txtexplicitly allows 27 AI crawlers including GPTBot, ClaudeBot, PerplexityBot, Google-Extended (15/18). No bots are blocked.WebSite,Organization,SoftwareApplication, andFAQPageJSON-LD schemas (12/16). FAQPage depth of 4 boosts Q&A-style citations./blog/rss.xml), and freshness date all present (6/6).🚨 Critical Gaps
No
robots.txton docs site (0/18) —github.github.com/gh-awhas no robots.txt. AI crawlers receive no explicit permission signals. This affects all 188 sitemap pages — every page scores 0 on this dimension.No
llms.txton docs site (0/18) — The docs site has no LLM-friendly structured summary. The README has one; the docs site needs its own athttps://github.github.com/gh-aw/llms.txt.Zero AI discovery endpoints (0/6 on both targets) — Neither site exposes
/.well-known/ai.txt,/ai/summary.json,/ai/faq.json, or/ai/service.json.No JSON-LD schema on README/GitHub page (0/16) — GitHub's repo page injects no structured data (WebSite, Organization, FAQPage). This is partially outside the project's direct control.
Keyword stuffing penalties —
githubat 2.8% density on docs site;issueat 22.2% on README (driven by GitHub UI boilerplate). Both trigger a -3 negative score penalty.Sitemap-wide Critical band — 19 of 20 audited pages score Critical (27–33/100). Root cause: missing
robots.txtandllms.txtcascade to every page.🔧 Recommended Fixes
Ordered by estimated impact:
robots.txttogithub.github.com/gh-awallowing AI crawlers/llms.txtfor docs site:gh aw geo llms --base-url https://github.github.com/gh-aw/.well-known/ai.txt,/ai/summary.json,/ai/faq.json,/ai/service.jsonWebAppJSON-LD type to docs site schema (currently missing)/aboutpage link on docs site for identity trustdateModifiedschema + RSS<link>tag to READMEsameAsKG pillar links to README Organization schema (Wikipedia, Wikidata, LinkedIn, Crunchbase)has_numbersis false; content score 7/12)<link>tag to README page (missing — may cause duplicate indexing)📋 Full Category Breakdown
Docs Site (
github.github.com/gh-aw/) — 42/100 Foundationgithub2.8%), hidden textTrust Stack: C (15/25) — Technical 3/5, Identity 3/5, Social 4/5, Academic 1/5, Consistency 4/5
Platform Citation Scores: ChatGPT 50, Perplexity 65, Google AI Overviews 80
Negative signals: Keyword stuffing (
github2.8%), hidden text (CtrlKwith display:none)README (
github.com/github/gh-aw) — 55/100 Foundationissue22.2%), 12 broken links, popup signalsTrust Stack: C (16/25) — Technical 5/5 (perfect!), Identity 3/5, Social 1/5, Academic 4/5, Consistency 3/5
Platform Citation Scores: ChatGPT 55, Perplexity 70, Google AI Overviews 48
Content decay risk: HIGH (evergreen score 30/100) — 10 temporal/version signals detected
📄 Sitemap Page Scores (20 of 188 audited)
.../.../blog.../blog/3,/blog/4,/blog/8.../blog/2,/blog/5,/blog/6.../blog/continuous-refactoring.../blog/7.../blog/9,/blog/welcome-*,/blog/advanced-analytics,/blog/campaigns.../blog/continuous-simplicity,/blog/continuous-style,/blog/creative-culture,/blog/documentation.../agent-factory-statusRoot cause: Every page scores 0 on Robots.txt and llms.txt (site-wide gaps). Adding
robots.txtandllms.txtto the docs site would immediately lift all 188 pages by up to 36 points.Automated audit powered by geo-optimizer-skill · Run logs
Beta Was this translation helpful? Give feedback.
All reactions