[geo-optimizer] GEO Audit Report — 2026-06-17 #39857

2026-06-17T17:30:40Z

github-actions[bot]
Bot Jun 17, 2026

GEO Audit Report — github/gh-aw

Audit Date: 2026-06-17
Run: §27707225244

📊 Scores

Target	Score	Band
Docs site (`github.github.com/gh-aw/`)	42/100	Foundation
README (`github.com/github/gh-aw`)	55/100	Foundation
Sitemap average (20 pages sampled)	31.65/100	Critical

✅ Top Strengths

README robots.txt (15/18): All major AI crawlers explicitly allowed — GPTBot, ClaudeBot, PerplexityBot, Google-Extended, and 23 others.
README llms.txt (14/18): Present and well-structured with 12 sections, 117 links, and 2,919 words.
README content (12/12 — full score): Rich heading hierarchy (42 headings), 3,006 words, numbers, external links, and front-loaded key info.
Docs site meta tags (14/14 — full score): Complete title, description, canonical URL, and Open Graph tags across the site.
Docs site signals (6/6 — full score): RSS feed present, freshness date (2026-05-09), and lang attribute set.
Docs site JSON-LD schema (12/16): WebSite, Organization, SoftwareApplication, and FAQPage schemas present.
Docs site Google AI score: 80/100 — strong for that platform thanks to complete meta tags, sameAs links, and structured data.
README citability: 84/100 (high) — well-positioned for citation by ChatGPT (55) and Perplexity (70).

🚨 Critical Gaps

Docs site has no robots.txt (0/18): AI crawlers receive no explicit access rules — the most impactful single gap, penalising every page in the sitemap.
Docs site has no llms.txt (0/18): All 20 sampled pages score 0 here; adding this file would lift the sitemap average significantly.
AI Discovery endpoints missing on both targets (0/6 each): Neither site has /.well-known/ai.txt, /ai/summary.json, /ai/faq.json, or /ai/service.json.
README has no JSON-LD schema (0/16): No WebSite, Organization, or FAQPage schema — the largest single gap on the README.
Docs site content is thin (7/12): Only 921 words on the homepage with zero statistics or numerical data, weak for AI snippet selection and content quality signals.

🔧 Recommended Fixes

Ordered by estimated impact:

Create robots.txt for github.github.com/gh-aw/ — Allow GPTBot, ClaudeBot, PerplexityBot, Google-Extended. Single file; unlocks up to +18 pts per page across the entire docs site. Run geo llms --base-url https://github.github.com/gh-aw to scaffold.
Create /llms.txt for the docs site — Machine-readable site summary with H1, description, sections, and links. Adds up to +18 pts per page sitewide. Scaffold with: geo llms --base-url https://github.github.com/gh-aw.
Add JSON-LD schema to the README / GitHub repo page — Inject WebSite, Organization (with sameAs to Wikipedia/Wikidata/LinkedIn/Crunchbase), and FAQPage schemas. Potential +16 pts on README.
Create AI discovery endpoints — /.well-known/ai.txt, /ai/summary.json, /ai/faq.json, /ai/service.json for both the docs site and README. Each is +6 pts AI Discovery.
Add <link rel="canonical"> to README/GitHub page — Prevents duplicate content penalisation in AI indexing (+3 pts meta).
Fix 12 broken links on the README — Currently triggering a medium-severity negative signal and −3 score penalty.
Enrich docs site homepage content — Add concrete statistics and numerical data (e.g., workflow count, performance numbers). The audit notes that adding stats can yield +40% AI visibility; currently numbers_count: 0.
Diversify keyword vocabulary — README: 'issue' at 23.2% density; docs site: 'github' at 2.8%. Both flagged for keyword stuffing.
Add dateModified schema / freshness signals to README — Currently no freshness date detected, triggering content decay warnings for temporal signals.
Add VideoObject schema to docs site — 2 videos detected with captions but no schema; add name, description, thumbnailUrl, uploadDate.

📋 Full Breakdown by Category

README (`github.com/github/gh-aw`) — 55/100 Foundation

Category	Score	Max	Notes
Robots.txt	15	18	All AI bots allowed; not explicitly listed per-bot
llms.txt	14	18	Present, 12 sections, 117 links, 2919 words; no `llms-full.txt`
Schema JSON-LD	0	16	None found
Meta Tags	11	14	Missing canonical URL
Content	12	12	Full score
Signals	3	6	No RSS, no freshness date
AI Discovery	0	6	No well-known/ai.txt, no /ai/* endpoints
Brand & Entity	3	10	No Wikipedia/Wikidata/LinkedIn/Crunchbase sameAs links
Negative Penalty	−3	—	Hidden text, keyword stuffing (issue 23.2%), 12 broken links

Trust stack: Grade C / Medium — Technical 5/5, Academic 4/5, Identity 3/5, Consistency 3/5, Social 1/5
Platform scores: Perplexity 70, ChatGPT 55, Google AI 48
WebMCP readiness: Basic (not agent-ready)
Chunk readiness: 46/100 — avg section 23.8 words, needs longer sections for RAG retrieval

Docs Site (`github.github.com/gh-aw/`) — 42/100 Foundation

Category	Score	Max	Notes
Robots.txt	0	18	robots.txt not found
llms.txt	0	18	llms.txt not found
Schema JSON-LD	12	16	WebSite, Organization, SoftwareApplication, FAQPage present
Meta Tags	14	14	Full score — canonical, OG, title, description all present
Content	7	12	921 words, no stats/numbers, no front-loading
Signals	6	6	Full score — RSS, freshness, lang
AI Discovery	0	6	No well-known/ai.txt, no /ai/* endpoints
Brand & Entity	6	10	Wikidata + LinkedIn present; missing Wikipedia, Crunchbase, about page
Negative Penalty	−3	—	Hidden text ("CtrlK"), keyword stuffing (github 2.8%)

Trust stack: Grade C / Medium — Consistency 4/5, Social 4/5, Identity 3/5, Technical 3/5, Academic 1/5
Platform scores: Google AI 80, Perplexity 65, ChatGPT 50
WebMCP readiness: Basic — SearchAction potential action detected
CDN note: AI crawlers receive HTTP 301 redirect; content accessible after redirect

📄 Sitemap Page Scores (20 of 188 pages audited)

URL	Score	Band
`/` (homepage)	42	Foundation
`/blog`	33	Critical
`/blog/3`, `/4`, `/5`, `/8`	33	Critical
`/blog/2`, `/6`	32	Critical
`/blog/2026-01-13-...-continuous-improvement`	33	Critical
`/blog/2026-01-13-...-continuous-refactoring`	32	Critical
Most blog posts	30	Critical
`/agent-factory-status`	27	Critical

Pattern: All non-homepage pages score 0 on Robots.txt, llms.txt, Schema, and AI Discovery. Meta tags are consistently 14/14. The site-wide average of 31.65 will improve dramatically once robots.txt and llms.txt are added at the root.

Sitemap size: 188 URLs discovered; only 20 audited (cap hit). Use --max-urls to expand coverage.

Automated audit powered by geo-optimizer-skill · Run logs

Generated by 🌍 GEO Optimizer Daily Audit · ◷

expires on Jun 20, 2026, 9:30 AM UTC-08:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[geo-optimizer] GEO Audit Report — 2026-06-17 #39857

Uh oh!

{{title}}

Uh oh!

README (`github.com/github/gh-aw`) — 55/100 Foundation

Docs Site (`github.github.com/gh-aw/`) — 42/100 Foundation

Replies: 0 comments

Select a reply

Uh oh!

[geo-optimizer] GEO Audit Report — 2026-06-17 #39857

Uh oh!

github-actions[bot] Bot Jun 17, 2026

GEO Audit Report — github/gh-aw

📊 Scores

✅ Top Strengths

🚨 Critical Gaps

🔧 Recommended Fixes

README (github.com/github/gh-aw) — 55/100 Foundation

Docs Site (github.github.com/gh-aw/) — 42/100 Foundation

Replies: 0 comments

github-actions[bot]
Bot Jun 17, 2026

README (`github.com/github/gh-aw`) — 55/100 Foundation

Docs Site (`github.github.com/gh-aw/`) — 42/100 Foundation