You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
All AI crawlers unblocked — Neither site blocks any AI bot via CDN challenge or robots.txt Disallow.
🚨 Critical Gaps
Docs site has no robots.txt — Zero AI bots are explicitly allowed; this costs 18/18 points on every single page across 188 URLs in the sitemap.
Docs site has no llms.txt — AI engines cannot discover structured site context; another 18/18 points lost site-wide.
Both sites score 0/6 on AI Discovery — No /.well-known/ai.txt, /ai/summary.json, /ai/faq.json, or /ai/service.json exist on either domain.
README has zero JSON-LD schema — No WebSite, WebApplication, or Organization markup (0/16); this is the highest-weight category missing from the repo homepage.
Sitemap blog pages are uniformly Critical (avg 31.65) — 19 of 20 audited pages score in the Critical band, all with 0 on robots, llms, schema, and ai_discovery.
🔧 Recommended Fixes
Ordered by estimated point impact:
[Docs site] Add robots.txt with explicit Allow: / for GPTBot, ClaudeBot, PerplexityBot, Google-Extended, and similar AI crawlers. Single file, affects all 188 URLs. (+18 pts on every page)
[Docs site] Generate llms.txt via geo llms --base-url https://github.github.com/gh-aw. Provides structured AI indexing context. (+up to 18 pts)
[README] Add JSON-LD schema markup — At minimum: WebSite + Organization (with sameAs links to LinkedIn, Wikidata). This is a GitHub-controlled limitation but can be added via the repo description or a GitHub Pages site for project docs. (+up to 16 pts)
[README] Add <link rel="canonical"> — Prevents duplicate content issues in AI indexing (currently missing, -3 pts on meta).
[README] Add freshness signals — Add dateModified schema and an RSS/Atom <link> in <head>. (+3 pts on signals)
[Docs site] Front-load key information — Move the most important content (what gh-aw is, key capabilities) into the first 30% of homepage body. Improves AI snippet selection. (context efficiency: 65/100 → higher)
[Docs site blog] Add per-article schema — Add BlogPosting or Article JSON-LD to all blog entries (currently 0 schema across 19 critical pages).
[Both sites] Fix keyword density — "issue" at 23.2% and "github" at 2.8% density flagged as stuffing; diversify vocabulary.
[README] Add contact/social sameAs links to Organization schema — Currently 0 Knowledge Graph pillars; add LinkedIn, Wikidata, Wikipedia links for entity disambiguation.
RAG chunk readiness: Docs site 45/100, README 46/100 — sections are too short (avg 6.5–23.8 words); expand with more prose per section.
📄 Sitemap Page Scores (20 of 188 audited)
URL
Score
Band
/ (homepage)
42
Foundation
/blog
33
Critical
/blog/3–5, 8
33
Critical
/blog/2, 6, 16
32
Critical
/blog/7, 9
30
Critical
/blog/2026-01-12-welcome-to-pelis-agent-factory
30
Critical
/blog/2026-01-13-meet-the-workflows-* (7 posts)
30
Critical
/blog/2026-01-13-...-continuous-improvement
33
Critical
/agent-factory-status
27
Critical (worst)
Consistent pattern: All non-homepage pages score 0 on robots, llms, schema, and ai_discovery. Meta tags (14/14) and signals (5/6) are the only positive contributors. Adding robots.txt + llms.txt + article schema would lift the entire site.
Truncation warning: Sitemap has 188 URLs; only 20 were audited. Run with --max-urls to increase coverage.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
GEO Audit Report — github/gh-aw
Audit Date: 2026-06-15
Run: §27565878168
📊 Scores
github.github.com/gh-aw/)github.com/github/gh-aw)✅ Top Strengths
robots.txtexplicitly allows GPTBot, ClaudeBot, PerplexityBot, Google-Extended, and 24 others (robots score: 15/18).llms.txtpresent — 2,919 words, 12 sections, 117 links; solid structured discovery file (llms score: 14/18).WebSite,Organization,SoftwareApplication, andFAQPagetypes (schema score: 12/16)./blog/rss.xml, freshness date2026-05-09, andlang=enall present.robots.txtDisallow.🚨 Critical Gaps
robots.txt— Zero AI bots are explicitly allowed; this costs 18/18 points on every single page across 188 URLs in the sitemap.llms.txt— AI engines cannot discover structured site context; another 18/18 points lost site-wide./.well-known/ai.txt,/ai/summary.json,/ai/faq.json, or/ai/service.jsonexist on either domain.WebSite,WebApplication, orOrganizationmarkup (0/16); this is the highest-weight category missing from the repo homepage.🔧 Recommended Fixes
Ordered by estimated point impact:
robots.txtwith explicitAllow: /forGPTBot,ClaudeBot,PerplexityBot,Google-Extended, and similar AI crawlers. Single file, affects all 188 URLs. (+18 pts on every page)llms.txtviageo llms --base-url https://github.github.com/gh-aw. Provides structured AI indexing context. (+up to 18 pts)/.well-known/ai.txt,/ai/summary.json,/ai/faq.json,/ai/service.json. Zero-cost structured signals. (+6 pts each)WebSite+Organization(withsameAslinks to LinkedIn, Wikidata). This is a GitHub-controlled limitation but can be added via the repo description or a GitHub Pages site for project docs. (+up to 16 pts)<link rel="canonical">— Prevents duplicate content issues in AI indexing (currently missing, -3 pts on meta).dateModifiedschema and an RSS/Atom<link>in<head>. (+3 pts on signals)BlogPostingorArticleJSON-LD to all blog entries (currently 0 schema across 19 critical pages)."issue"at 23.2% and"github"at 2.8% density flagged as stuffing; diversify vocabulary.📋 Full Breakdown by Category
Docs Site (
github.github.com/gh-aw/)Trust Stack: Technical 3/5, Identity 3/5, Social 4/5, Academic 1/5, Consistency 4/5 → Grade C
Platform citation scores: ChatGPT 50, Perplexity 65, Google AI 80
README (
github.com/github/gh-aw)llms-full.txtlang=enonly; no RSS, no freshness dateissue23.2%), 12 broken linksTrust Stack: Technical 5/5, Identity 3/5, Social 1/5, Academic 4/5, Consistency 3/5 → Grade C
Platform citation scores: ChatGPT 55, Perplexity 70, Google AI 48
Negative signals: Hidden text (
display:nonenav), keyword stuffing, 12 broken links detected.RAG chunk readiness: Docs site 45/100, README 46/100 — sections are too short (avg 6.5–23.8 words); expand with more prose per section.
📄 Sitemap Page Scores (20 of 188 audited)
/(homepage)/blog/blog/3–5, 8/blog/2, 6, 16/blog/7, 9/blog/2026-01-12-welcome-to-pelis-agent-factory/blog/2026-01-13-meet-the-workflows-*(7 posts)/blog/2026-01-13-...-continuous-improvement/agent-factory-statusConsistent pattern: All non-homepage pages score 0 on robots, llms, schema, and ai_discovery. Meta tags (14/14) and signals (5/6) are the only positive contributors. Adding
robots.txt+llms.txt+ article schema would lift the entire site.Truncation warning: Sitemap has 188 URLs; only 20 were audited. Run with
--max-urlsto increase coverage.Automated audit powered by geo-optimizer-skill · Run logs
Beta Was this translation helpful? Give feedback.
All reactions