You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
README robots.txt (15/18): All major AI crawlers explicitly allowed — GPTBot, ClaudeBot, PerplexityBot, Google-Extended, and 23 others.
README llms.txt (14/18): Present and well-structured with 12 sections, 117 links, and 2,919 words.
README content (12/12 — full score): Rich heading hierarchy (42 headings), 3,006 words, numbers, external links, and front-loaded key info.
Docs site meta tags (14/14 — full score): Complete title, description, canonical URL, and Open Graph tags across the site.
Docs site signals (6/6 — full score): RSS feed present, freshness date (2026-05-09), and lang attribute set.
Docs site JSON-LD schema (12/16): WebSite, Organization, SoftwareApplication, and FAQPage schemas present.
Docs site Google AI score: 80/100 — strong for that platform thanks to complete meta tags, sameAs links, and structured data.
README citability: 84/100 (high) — well-positioned for citation by ChatGPT (55) and Perplexity (70).
🚨 Critical Gaps
Docs site has no robots.txt (0/18): AI crawlers receive no explicit access rules — the most impactful single gap, penalising every page in the sitemap.
Docs site has no llms.txt (0/18): All 20 sampled pages score 0 here; adding this file would lift the sitemap average significantly.
AI Discovery endpoints missing on both targets (0/6 each): Neither site has /.well-known/ai.txt, /ai/summary.json, /ai/faq.json, or /ai/service.json.
README has no JSON-LD schema (0/16): No WebSite, Organization, or FAQPage schema — the largest single gap on the README.
Docs site content is thin (7/12): Only 921 words on the homepage with zero statistics or numerical data, weak for AI snippet selection and content quality signals.
🔧 Recommended Fixes
Ordered by estimated impact:
Create robots.txt for github.github.com/gh-aw/ — Allow GPTBot, ClaudeBot, PerplexityBot, Google-Extended. Single file; unlocks up to +18 pts per page across the entire docs site. Run geo llms --base-url https://github.github.com/gh-aw to scaffold.
Create /llms.txt for the docs site — Machine-readable site summary with H1, description, sections, and links. Adds up to +18 pts per page sitewide. Scaffold with: geo llms --base-url https://github.github.com/gh-aw.
Add JSON-LD schema to the README / GitHub repo page — Inject WebSite, Organization (with sameAs to Wikipedia/Wikidata/LinkedIn/Crunchbase), and FAQPage schemas. Potential +16 pts on README.
Create AI discovery endpoints — /.well-known/ai.txt, /ai/summary.json, /ai/faq.json, /ai/service.json for both the docs site and README. Each is +6 pts AI Discovery.
Add <link rel="canonical"> to README/GitHub page — Prevents duplicate content penalisation in AI indexing (+3 pts meta).
Fix 12 broken links on the README — Currently triggering a medium-severity negative signal and −3 score penalty.
Enrich docs site homepage content — Add concrete statistics and numerical data (e.g., workflow count, performance numbers). The audit notes that adding stats can yield +40% AI visibility; currently numbers_count: 0.
Diversify keyword vocabulary — README: 'issue' at 23.2% density; docs site: 'github' at 2.8%. Both flagged for keyword stuffing.
Add dateModified schema / freshness signals to README — Currently no freshness date detected, triggering content decay warnings for temporal signals.
Add VideoObject schema to docs site — 2 videos detected with captions but no schema; add name, description, thumbnailUrl, uploadDate.
📋 Full Breakdown by Category
README (github.com/github/gh-aw) — 55/100 Foundation
Category
Score
Max
Notes
Robots.txt
15
18
All AI bots allowed; not explicitly listed per-bot
llms.txt
14
18
Present, 12 sections, 117 links, 2919 words; no llms-full.txt
Schema JSON-LD
0
16
None found
Meta Tags
11
14
Missing canonical URL
Content
12
12
Full score
Signals
3
6
No RSS, no freshness date
AI Discovery
0
6
No well-known/ai.txt, no /ai/* endpoints
Brand & Entity
3
10
No Wikipedia/Wikidata/LinkedIn/Crunchbase sameAs links
Full score — canonical, OG, title, description all present
Content
7
12
921 words, no stats/numbers, no front-loading
Signals
6
6
Full score — RSS, freshness, lang
AI Discovery
0
6
No well-known/ai.txt, no /ai/* endpoints
Brand & Entity
6
10
Wikidata + LinkedIn present; missing Wikipedia, Crunchbase, about page
Negative Penalty
−3
—
Hidden text ("CtrlK"), keyword stuffing (github 2.8%)
Trust stack: Grade C / Medium — Consistency 4/5, Social 4/5, Identity 3/5, Technical 3/5, Academic 1/5 Platform scores: Google AI 80, Perplexity 65, ChatGPT 50 WebMCP readiness: Basic — SearchAction potential action detected CDN note: AI crawlers receive HTTP 301 redirect; content accessible after redirect
📄 Sitemap Page Scores (20 of 188 pages audited)
URL
Score
Band
/ (homepage)
42
Foundation
/blog
33
Critical
/blog/3, /4, /5, /8
33
Critical
/blog/2, /6
32
Critical
/blog/2026-01-13-...-continuous-improvement
33
Critical
/blog/2026-01-13-...-continuous-refactoring
32
Critical
Most blog posts
30
Critical
/agent-factory-status
27
Critical
Pattern: All non-homepage pages score 0 on Robots.txt, llms.txt, Schema, and AI Discovery. Meta tags are consistently 14/14. The site-wide average of 31.65 will improve dramatically once robots.txt and llms.txt are added at the root.
Sitemap size: 188 URLs discovered; only 20 audited (cap hit). Use --max-urls to expand coverage.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
GEO Audit Report — github/gh-aw
Audit Date: 2026-06-17
Run: §27707225244
📊 Scores
github.github.com/gh-aw/)github.com/github/gh-aw)✅ Top Strengths
langattribute set.🚨 Critical Gaps
robots.txt(0/18): AI crawlers receive no explicit access rules — the most impactful single gap, penalising every page in the sitemap.llms.txt(0/18): All 20 sampled pages score 0 here; adding this file would lift the sitemap average significantly./.well-known/ai.txt,/ai/summary.json,/ai/faq.json, or/ai/service.json.🔧 Recommended Fixes
Ordered by estimated impact:
robots.txtforgithub.github.com/gh-aw/— Allow GPTBot, ClaudeBot, PerplexityBot, Google-Extended. Single file; unlocks up to +18 pts per page across the entire docs site. Rungeo llms --base-url https://github.github.com/gh-awto scaffold./llms.txtfor the docs site — Machine-readable site summary with H1, description, sections, and links. Adds up to +18 pts per page sitewide. Scaffold with:geo llms --base-url https://github.github.com/gh-aw.sameAsto Wikipedia/Wikidata/LinkedIn/Crunchbase), and FAQPage schemas. Potential +16 pts on README./.well-known/ai.txt,/ai/summary.json,/ai/faq.json,/ai/service.jsonfor both the docs site and README. Each is +6 pts AI Discovery.<link rel="canonical">to README/GitHub page — Prevents duplicate content penalisation in AI indexing (+3 pts meta).numbers_count: 0.dateModifiedschema / freshness signals to README — Currently no freshness date detected, triggering content decay warnings for temporal signals.VideoObjectschema to docs site — 2 videos detected with captions but no schema; addname,description,thumbnailUrl,uploadDate.📋 Full Breakdown by Category
README (
github.com/github/gh-aw) — 55/100 Foundationllms-full.txtTrust stack: Grade C / Medium — Technical 5/5, Academic 4/5, Identity 3/5, Consistency 3/5, Social 1/5
Platform scores: Perplexity 70, ChatGPT 55, Google AI 48
WebMCP readiness: Basic (not agent-ready)
Chunk readiness: 46/100 — avg section 23.8 words, needs longer sections for RAG retrieval
Docs Site (
github.github.com/gh-aw/) — 42/100 FoundationTrust stack: Grade C / Medium — Consistency 4/5, Social 4/5, Identity 3/5, Technical 3/5, Academic 1/5
Platform scores: Google AI 80, Perplexity 65, ChatGPT 50
WebMCP readiness: Basic — SearchAction potential action detected
CDN note: AI crawlers receive HTTP 301 redirect; content accessible after redirect
📄 Sitemap Page Scores (20 of 188 pages audited)
/(homepage)/blog/blog/3,/4,/5,/8/blog/2,/6/blog/2026-01-13-...-continuous-improvement/blog/2026-01-13-...-continuous-refactoring/agent-factory-statusPattern: All non-homepage pages score 0 on Robots.txt, llms.txt, Schema, and AI Discovery. Meta tags are consistently 14/14. The site-wide average of 31.65 will improve dramatically once
robots.txtandllms.txtare added at the root.Sitemap size: 188 URLs discovered; only 20 audited (cap hit). Use
--max-urlsto expand coverage.Automated audit powered by geo-optimizer-skill · Run logs
Beta Was this translation helpful? Give feedback.
All reactions