[geo-optimizer] GEO Audit Report — 2026-06-15 #39436

2026-06-15T18:08:06Z

github-actions[bot]
Bot Jun 15, 2026

GEO Audit Report — github/gh-aw

Audit Date: 2026-06-15
Run: §27565878168

📊 Scores

Target	Score	Band
Docs site (`github.github.com/gh-aw/`)	42/100	Foundation
README (`github.com/github/gh-aw`)	55/100	Foundation
Sitemap avg (20/188 pages audited)	31.65/100	Critical

✅ Top Strengths

README: All major AI bots permitted — GitHub's robots.txt explicitly allows GPTBot, ClaudeBot, PerplexityBot, Google-Extended, and 24 others (robots score: 15/18).
README: llms.txt present — 2,919 words, 12 sections, 117 links; solid structured discovery file (llms score: 14/18).
Docs site: Perfect meta tags (14/14) — Title, description, canonical, and all Open Graph tags are fully populated.
Docs site: Rich structured data — JSON-LD schema includes WebSite, Organization, SoftwareApplication, and FAQPage types (schema score: 12/16).
Docs site: Full signals score (6/6) — RSS feed at /blog/rss.xml, freshness date 2026-05-09, and lang=en all present.
README: Perfect content score (12/12) — 3,006 words, 42 headings, structured lists and tables, numbers/statistics cited.
All AI crawlers unblocked — Neither site blocks any AI bot via CDN challenge or robots.txt Disallow.

🚨 Critical Gaps

Docs site has no robots.txt — Zero AI bots are explicitly allowed; this costs 18/18 points on every single page across 188 URLs in the sitemap.
Docs site has no llms.txt — AI engines cannot discover structured site context; another 18/18 points lost site-wide.
Both sites score 0/6 on AI Discovery — No /.well-known/ai.txt, /ai/summary.json, /ai/faq.json, or /ai/service.json exist on either domain.
README has zero JSON-LD schema — No WebSite, WebApplication, or Organization markup (0/16); this is the highest-weight category missing from the repo homepage.
Sitemap blog pages are uniformly Critical (avg 31.65) — 19 of 20 audited pages score in the Critical band, all with 0 on robots, llms, schema, and ai_discovery.

🔧 Recommended Fixes

Ordered by estimated point impact:

[Docs site] Add robots.txt with explicit Allow: / for GPTBot, ClaudeBot, PerplexityBot, Google-Extended, and similar AI crawlers. Single file, affects all 188 URLs. (+18 pts on every page)
[Docs site] Generate llms.txt via geo llms --base-url https://github.github.com/gh-aw. Provides structured AI indexing context. (+up to 18 pts)
[Both sites] Create AI Discovery endpoints — /.well-known/ai.txt, /ai/summary.json, /ai/faq.json, /ai/service.json. Zero-cost structured signals. (+6 pts each)
[README] Add JSON-LD schema markup — At minimum: WebSite + Organization (with sameAs links to LinkedIn, Wikidata). This is a GitHub-controlled limitation but can be added via the repo description or a GitHub Pages site for project docs. (+up to 16 pts)
[README] Add <link rel="canonical"> — Prevents duplicate content issues in AI indexing (currently missing, -3 pts on meta).
[README] Add freshness signals — Add dateModified schema and an RSS/Atom <link> in <head>. (+3 pts on signals)
[Docs site] Front-load key information — Move the most important content (what gh-aw is, key capabilities) into the first 30% of homepage body. Improves AI snippet selection. (context efficiency: 65/100 → higher)
[Docs site blog] Add per-article schema — Add BlogPosting or Article JSON-LD to all blog entries (currently 0 schema across 19 critical pages).
[Both sites] Fix keyword density — "issue" at 23.2% and "github" at 2.8% density flagged as stuffing; diversify vocabulary.
[README] Add contact/social sameAs links to Organization schema — Currently 0 Knowledge Graph pillars; add LinkedIn, Wikidata, Wikipedia links for entity disambiguation.

📋 Full Breakdown by Category

Docs Site (`github.github.com/gh-aw/`)

Category	Score	Max	Notes
Robots.txt	0	18	❌ File not found
llms.txt	0	18	❌ File not found
Schema JSON-LD	12	16	✅ WebSite, Organization, SoftwareApplication, FAQPage
Meta Tags	14	14	✅ Perfect
Content	7	12	⚠️ 921 words, no numbers/stats, not front-loaded
Signals	6	6	✅ RSS + freshness date present
AI Discovery	0	6	❌ No .well-known/ai.txt or /ai/*.json
Brand/Entity	6	10	⚠️ Has Wikidata + LinkedIn; no Wikipedia/Crunchbase/about page
Negative penalty	-3	—	Hidden text, keyword stuffing, video without VideoObject schema
Total	42	100	Foundation

Trust Stack: Technical 3/5, Identity 3/5, Social 4/5, Academic 1/5, Consistency 4/5 → Grade C

Platform citation scores: ChatGPT 50, Perplexity 65, Google AI 80

README (`github.com/github/gh-aw`)

Category	Score	Max	Notes
Robots.txt	15	18	✅ All major AI bots allowed (no explicit citation-bot declarations)
llms.txt	14	18	✅ Present, 2,919 words, 12 sections, 117 links; no `llms-full.txt`
Schema JSON-LD	0	16	❌ No schema markup at all
Meta Tags	11	14	⚠️ Missing canonical URL
Content	12	12	✅ Perfect — 3,006 words, 42 headings, lists, numbers
Signals	3	6	⚠️ `lang=en` only; no RSS, no freshness date
AI Discovery	0	6	❌ No .well-known/ai.txt or /ai/*.json
Brand/Entity	3	10	⚠️ Brand consistent; no KG pillars, no contact info, no sameAs
Negative penalty	-3	—	Hidden nav text, keyword stuffing (`issue` 23.2%), 12 broken links
Total	55	100	Foundation

Trust Stack: Technical 5/5, Identity 3/5, Social 1/5, Academic 4/5, Consistency 3/5 → Grade C

Platform citation scores: ChatGPT 55, Perplexity 70, Google AI 48

Negative signals: Hidden text (display:none nav), keyword stuffing, 12 broken links detected.

RAG chunk readiness: Docs site 45/100, README 46/100 — sections are too short (avg 6.5–23.8 words); expand with more prose per section.

📄 Sitemap Page Scores (20 of 188 audited)

URL	Score	Band
`/` (homepage)	42	Foundation
`/blog`	33	Critical
`/blog/3–5, 8`	33	Critical
`/blog/2, 6, 16`	32	Critical
`/blog/7, 9`	30	Critical
`/blog/2026-01-12-welcome-to-pelis-agent-factory`	30	Critical
`/blog/2026-01-13-meet-the-workflows-*` (7 posts)	30	Critical
`/blog/2026-01-13-...-continuous-improvement`	33	Critical
`/agent-factory-status`	27	Critical (worst)

Consistent pattern: All non-homepage pages score 0 on robots, llms, schema, and ai_discovery. Meta tags (14/14) and signals (5/6) are the only positive contributors. Adding robots.txt + llms.txt + article schema would lift the entire site.

Truncation warning: Sitemap has 188 URLs; only 20 were audited. Run with --max-urls to increase coverage.

Automated audit powered by geo-optimizer-skill · Run logs

Generated by 🌍 GEO Optimizer Daily Audit · 76.2 AIC · ⌖ 13.7 AIC · ⊞ 16.1K · ◷

expires on Jun 18, 2026, 10:08 AM UTC-08:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[geo-optimizer] GEO Audit Report — 2026-06-15 #39436

Uh oh!

{{title}}

Uh oh!

Docs Site (`github.github.com/gh-aw/`)

README (`github.com/github/gh-aw`)

Replies: 0 comments

Select a reply

Uh oh!

[geo-optimizer] GEO Audit Report — 2026-06-15 #39436

Uh oh!

github-actions[bot] Bot Jun 15, 2026

GEO Audit Report — github/gh-aw

📊 Scores

✅ Top Strengths

🚨 Critical Gaps

🔧 Recommended Fixes

Docs Site (github.github.com/gh-aw/)

README (github.com/github/gh-aw)

Replies: 0 comments

github-actions[bot]
Bot Jun 15, 2026

Docs Site (`github.github.com/gh-aw/`)

README (`github.com/github/gh-aw`)