You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
10 Critical, 10 Foundation — robots/llms/ai_discovery zero sitewide
✅ Top Strengths
README — AI bot access: All major bots explicitly allowed in robots.txt — GPTBot, ClaudeBot, PerplexityBot, Google-Extended, Anthropic, Cohere, xAI-Bot, and 20+ more. citation_bots_ok: true.
README — llms.txt quality: 14/18 — found at root, 12 sections, 117 links, 2,919 words, includes blockquote. Well-structured for LLM indexers.
Docs site — Meta tags: Perfect 14/14 — title, description, canonical URL, and full Open Graph tags all present.
Docs site — Schema markup: 12/16 — WebSite, Organization, SoftwareApplication, and FAQPage JSON-LD all present.
Docs site — Signals: Perfect 6/6 — RSS feed linked, freshness date 2026-05-09 present.
Docs site — Brand/entity: 7/10 — Knowledge Graph pillars confirmed: LinkedIn, Wikidata, Wikipedia, Crunchbase all linked via sameAs.
Docs site — Google AI citation score: 80/100 (vs 48 for README). Strongest platform for the docs site.
🚨 Critical Gaps
Docs site: No robots.txt — scores 0/18 on every single audited page (all 20). AI crawlers (GPTBot, ClaudeBot, PerplexityBot) have no explicit permissions for github.github.com/gh-aw/. This is the single highest-impact gap.
Docs site: No llms.txt — scores 0/18 sitewide. The structured LLM index file that performs well on the README is entirely absent from the docs site.
Both sites: No AI discovery endpoints — both score 0/6 for ai_discovery. Missing: /.well-known/ai.txt, /ai/summary.json, /ai/faq.json, /ai/service.json.
README: No JSON-LD schema — 0/16 (the docs site scores 12/16). No WebSite, Organization, or FAQPage structured data. Note: adding schema to GitHub.com pages requires GitHub platform support.
Keyword stuffing on README: 'issue' detected at 22.7% density — contributes to the -3 negative penalty and lowers AI trust signals.
🔧 Recommended Fixes
Ordered by estimated impact (points gained across the site):
Priority
Fix
Est. Impact
Applies To
🔴 1
Add robots.txt to docs site root allowing GPTBot, ClaudeBot, PerplexityBot
+18 pts/page × all pages
Docs site
🔴 2
Add llms.txt to docs site (/llms.txt + optionally /llms-full.txt)
Paginated blog index pages (/blog/2–/blog/9) all score 30–33 with 0 schema, low brand entity, and a negative penalty. Adding per-page schema and reducing duplicate/thin content would help these most.
Total sitemap: 191 URLs discovered, 20 audited. Average 37/100. Band distribution: 10 Foundation, 10 Critical.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
GEO Audit Report — github/gh-aw
Audit Date: 2026-06-23
Run: §28043137890
📊 Scores
github.com/github/gh-aw)github.github.com/gh-aw/)✅ Top Strengths
robots.txt— GPTBot, ClaudeBot, PerplexityBot, Google-Extended, Anthropic, Cohere, xAI-Bot, and 20+ more.citation_bots_ok: true.WebSite,Organization,SoftwareApplication, andFAQPageJSON-LD all present.2026-05-09present.sameAs.🚨 Critical Gaps
robots.txt— scores 0/18 on every single audited page (all 20). AI crawlers (GPTBot, ClaudeBot, PerplexityBot) have no explicit permissions forgithub.github.com/gh-aw/. This is the single highest-impact gap.llms.txt— scores 0/18 sitewide. The structured LLM index file that performs well on the README is entirely absent from the docs site.ai_discovery. Missing:/.well-known/ai.txt,/ai/summary.json,/ai/faq.json,/ai/service.json.WebSite,Organization, orFAQPagestructured data. Note: adding schema to GitHub.com pages requires GitHub platform support.'issue'detected at 22.7% density — contributes to the -3 negative penalty and lowers AI trust signals.🔧 Recommended Fixes
Ordered by estimated impact (points gained across the site):
robots.txtto docs site root allowing GPTBot, ClaudeBot, PerplexityBotllms.txtto docs site (/llms.txt+ optionally/llms-full.txt)/.well-known/ai.txt,/ai/summary.json,/ai/faq.json,/ai/service.jsonVideoObjectschema for the 2 videos on the docs homepage (name, description, thumbnailUrl, uploadDate)'issue'keyword density from 22.7%sameAslinks (Wikipedia, Wikidata, LinkedIn, Crunchbase) to README Organization schemadateModifiedschema to README to signal content freshness (content decay risk: high)descriptionwith metadescriptionon both sites (schema_desc_matches_meta: false)📋 Full Category Breakdown
README (
github.com/github/gh-aw) — 55/100Allow:directives, implicit)llms-full.txt, no optional sectionTrust stack: Technical 5/5 ✅ · Identity 3/5 · Social 1/5 · Academic 4/5 · Consistency 3/5 → Grade C (medium)
Platform citation scores: ChatGPT 55 · Perplexity 70 · Google AI 48
Negative signals: Hidden text (nav elements),
'issue'keyword stuffing at 22.7%, 12 broken linksRAG chunk readiness: 46/100 — avg section only 23.8 words, low chunk density
Docs Site (
github.github.com/gh-aw/) — 43/100Trust stack: Technical 3/5 · Identity 3/5 · Social 4/5 ✅ · Academic 1/5 · Consistency 4/5 → Grade C (medium)
Platform citation scores: ChatGPT 50 · Perplexity 65 · Google AI 80 ✅
Multimodal: 2 videos detected, no VideoObject schema — opportunity for rich snippet
Content decay risk: Low (evergreen score: 100)
📄 Sitemap Page Scores (20 of 191 pages audited)
Consistent pattern across all 20 pages: Robots=0, llms=0, AI Discovery=0 — adding
robots.txtandllms.txtwould immediately lift all pages.Top 5 Pages
/blog/2026-01-13-meet-the-workflows-continuous-improvement/(homepage)/blog/2026-01-13-meet-the-workflows-continuous-refactoring/blog/2026-01-12-welcome-to-pelis-agent-factory/blog/2026-01-13-meet-the-workflows-advanced-analyticsBottom 5 Pages
/blog/7/blog/6/blog/9/blog/5/blog/8Paginated blog index pages (
/blog/2–/blog/9) all score 30–33 with 0 schema, low brand entity, and a negative penalty. Adding per-page schema and reducing duplicate/thin content would help these most.Total sitemap: 191 URLs discovered, 20 audited. Average 37/100. Band distribution: 10 Foundation, 10 Critical.
Automated audit powered by geo-optimizer-skill · Run logs
Beta Was this translation helpful? Give feedback.
All reactions