[geo-optimizer] GEO Audit Report — 2026-06-23 #41085

2026-06-23T17:14:11Z

github-actions[bot]
Bot Jun 23, 2026

GEO Audit Report — github/gh-aw

Audit Date: 2026-06-23
Run: §28043137890

📊 Scores

Target	Score	Band	Notes
README (`github.com/github/gh-aw`)	55/100	Foundation	AI bots allowed, llms.txt present, no schema
Docs site (`github.github.com/gh-aw/`)	43/100	Foundation	Great meta+schema, no robots.txt or llms.txt
Sitemap average (20/191 pages audited)	37/100	Foundation	10 Critical, 10 Foundation — robots/llms/ai_discovery zero sitewide

✅ Top Strengths

README — AI bot access: All major bots explicitly allowed in robots.txt — GPTBot, ClaudeBot, PerplexityBot, Google-Extended, Anthropic, Cohere, xAI-Bot, and 20+ more. citation_bots_ok: true.
README — llms.txt quality: 14/18 — found at root, 12 sections, 117 links, 2,919 words, includes blockquote. Well-structured for LLM indexers.
README — Content richness: Perfect 12/12 — 42 headings, 2,924 words, heading hierarchy, lists/tables, front-loaded key info.
Docs site — Meta tags: Perfect 14/14 — title, description, canonical URL, and full Open Graph tags all present.
Docs site — Schema markup: 12/16 — WebSite, Organization, SoftwareApplication, and FAQPage JSON-LD all present.
Docs site — Signals: Perfect 6/6 — RSS feed linked, freshness date 2026-05-09 present.
Docs site — Brand/entity: 7/10 — Knowledge Graph pillars confirmed: LinkedIn, Wikidata, Wikipedia, Crunchbase all linked via sameAs.
Docs site — Google AI citation score: 80/100 (vs 48 for README). Strongest platform for the docs site.

🚨 Critical Gaps

Docs site: No robots.txt — scores 0/18 on every single audited page (all 20). AI crawlers (GPTBot, ClaudeBot, PerplexityBot) have no explicit permissions for github.github.com/gh-aw/. This is the single highest-impact gap.
Docs site: No llms.txt — scores 0/18 sitewide. The structured LLM index file that performs well on the README is entirely absent from the docs site.
Both sites: No AI discovery endpoints — both score 0/6 for ai_discovery. Missing: /.well-known/ai.txt, /ai/summary.json, /ai/faq.json, /ai/service.json.
README: No JSON-LD schema — 0/16 (the docs site scores 12/16). No WebSite, Organization, or FAQPage structured data. Note: adding schema to GitHub.com pages requires GitHub platform support.
Keyword stuffing on README: 'issue' detected at 22.7% density — contributes to the -3 negative penalty and lowers AI trust signals.

🔧 Recommended Fixes

Ordered by estimated impact (points gained across the site):

Priority	Fix	Est. Impact	Applies To
🔴 1	Add `robots.txt` to docs site root allowing GPTBot, ClaudeBot, PerplexityBot	+18 pts/page × all pages	Docs site
🔴 2	Add `llms.txt` to docs site (`/llms.txt` + optionally `/llms-full.txt`)	+18 pts/page	Docs site
🟠 3	Create `/.well-known/ai.txt`, `/ai/summary.json`, `/ai/faq.json`, `/ai/service.json`	+6 pts/page	Both sites
🟠 4	Add `VideoObject` schema for the 2 videos on the docs homepage (name, description, thumbnailUrl, uploadDate)	Quality signal	Docs site
🟠 5	Add numerical statistics/concrete metrics to docs site homepage content (currently 0 numbers detected)	Content score lift	Docs site
🟡 6	Diversify vocabulary in README to reduce `'issue'` keyword density from 22.7%	Remove -1 penalty	README
🟡 7	Add `sameAs` links (Wikipedia, Wikidata, LinkedIn, Crunchbase) to README Organization schema	Brand entity lift	README
🟡 8	Investigate and fix the 12 broken links on the README page (penalizes trust score)	Trust improvement	README
🟢 9	Add `dateModified` schema to README to signal content freshness (content decay risk: high)	Freshness signal	README
🟢 10	Align schema `description` with meta `description` on both sites (`schema_desc_matches_meta: false`)	Consistency trust	Both

📋 Full Category Breakdown

README (`github.com/github/gh-aw`) — 55/100

Category	Score	Max	Notes
Robots.txt	15	18	All major AI bots allowed; not explicit (no `Allow:` directives, implicit)
llms.txt	14	18	Present, 12 sections, 117 links, no `llms-full.txt`, no optional section
Schema JSON-LD	0	16	None — no WebSite, Organization, or FAQPage
Meta Tags	11	14	No canonical URL
Content	12	12	✅ Perfect
Signals	3	6	No RSS, no freshness date
AI Discovery	0	6	No ai.txt, no /ai/*.json endpoints
Brand & Entity	3	10	No KG pillars, no contact info, no sameAs
Negative Penalty	−3	—	Hidden text, keyword stuffing, popup signals

Trust stack: Technical 5/5 ✅ · Identity 3/5 · Social 1/5 · Academic 4/5 · Consistency 3/5 → Grade C (medium)

Platform citation scores: ChatGPT 55 · Perplexity 70 · Google AI 48

Negative signals: Hidden text (nav elements), 'issue' keyword stuffing at 22.7%, 12 broken links

RAG chunk readiness: 46/100 — avg section only 23.8 words, low chunk density

Docs Site (`github.github.com/gh-aw/`) — 43/100

Category	Score	Max	Notes
Robots.txt	0	18	Missing entirely
llms.txt	0	18	Missing entirely
Schema JSON-LD	12	16	WebSite ✅, Organization ✅, SoftwareApplication ✅, FAQPage ✅
Meta Tags	14	14	✅ Perfect — title, desc, canonical, OG all present
Content	7	12	923 words, no numbers/stats, no front-loading
Signals	6	6	✅ Perfect — RSS + freshness date
AI Discovery	0	6	No ai.txt, no /ai/*.json endpoints
Brand & Entity	7	10	KG pillars: 4/4 ✅, but no about page link, no author
Negative Penalty	−3	—	Hidden text, keyword stuffing ('github' 2.8%)

Trust stack: Technical 3/5 · Identity 3/5 · Social 4/5 ✅ · Academic 1/5 · Consistency 4/5 → Grade C (medium)

Platform citation scores: ChatGPT 50 · Perplexity 65 · Google AI 80 ✅

Multimodal: 2 videos detected, no VideoObject schema — opportunity for rich snippet

Content decay risk: Low (evergreen score: 100)

📄 Sitemap Page Scores (20 of 191 pages audited)

Consistent pattern across all 20 pages: Robots=0, llms=0, AI Discovery=0 — adding robots.txt and llms.txt would immediately lift all pages.

Top 5 Pages

URL	Score	Band
`/blog/2026-01-13-meet-the-workflows-continuous-improvement`	44	Foundation
`/` (homepage)	43	Foundation
`/blog/2026-01-13-meet-the-workflows-continuous-refactoring`	43	Foundation
`/blog/2026-01-12-welcome-to-pelis-agent-factory`	41	Foundation
`/blog/2026-01-13-meet-the-workflows-advanced-analytics`	41	Foundation

Bottom 5 Pages

URL	Score	Band
`/blog/7`	30	Critical
`/blog/6`	32	Critical
`/blog/9`	32	Critical
`/blog/5`	33	Critical
`/blog/8`	33	Critical

Paginated blog index pages (/blog/2–/blog/9) all score 30–33 with 0 schema, low brand entity, and a negative penalty. Adding per-page schema and reducing duplicate/thin content would help these most.

Total sitemap: 191 URLs discovered, 20 audited. Average 37/100. Band distribution: 10 Foundation, 10 Critical.

Automated audit powered by geo-optimizer-skill · Run logs

Generated by 🌍 GEO Optimizer Daily Audit · 28.2 AIC · ⌖ 11 AIC · ⊞ 4.5K · ◷

expires on Jun 26, 2026, 9:14 AM UTC-08:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[geo-optimizer] GEO Audit Report — 2026-06-23 #41085

Uh oh!

{{title}}

Uh oh!

README (`github.com/github/gh-aw`) — 55/100

Docs Site (`github.github.com/gh-aw/`) — 43/100

Top 5 Pages

Bottom 5 Pages

Replies: 0 comments

Select a reply

Uh oh!

[geo-optimizer] GEO Audit Report — 2026-06-23 #41085

Uh oh!

github-actions[bot] Bot Jun 23, 2026

GEO Audit Report — github/gh-aw

📊 Scores

✅ Top Strengths

🚨 Critical Gaps

🔧 Recommended Fixes

README (github.com/github/gh-aw) — 55/100

Docs Site (github.github.com/gh-aw/) — 43/100

Top 5 Pages

Bottom 5 Pages

Replies: 0 comments

github-actions[bot]
Bot Jun 23, 2026

README (`github.com/github/gh-aw`) — 55/100

Docs Site (`github.github.com/gh-aw/`) — 43/100