╔══════════════════════════════════════════════════════════════╗
║ ║
║ ▗▄▄▄ ▗▄▖ ▗▄▄▖▗▖ ▗▖▗▖ ▗▖▗▄▄▄▖▗▖ ▗▖▗▄▄▄▖▗▄▖ ▗▄▄▖ ║
║ ▐▌ █▐▌ ▐▌▐▌ ▐▌ ▐▌▐▛▚▞▜▌▐▌ ▐▛▚▖▐▌ █ ▐▌ ▐▌▐▌ ▐▌ ║
║ ▐▌ █▐▌ ▐▌▐▌ ▐▌ ▐▌▐▌ ▐▌▐▛▀▀▘▐▌ ▝▜▌ █ ▐▌ ▐▌▐▛▀▚▖ ║
║ ▐▙▄▄▀▝▚▄▞▘▝▚▄▄▖▝▚▄▞▘▐▌ ▐▌▐▙▄▄▖▐▌ ▐▌ █ ▝▚▄▞▘▐▌ ▐▌ ║
║ ║
║ 📚 ────→ 🕷️ ────→ 📖 ────→ 🔍 ────→ 🤖 ───→ 🎉 ║
║ Docs Spider Aggregate Index AI Assistants ║
║ ║
╚══════════════════════════════════════════════════════════════╝
A repository for managing and aggregating external documentation for use with AI assistants and LLM-powered tools.
This repository contains tools and documentation for spidering, aggregating, and indexing external documentation sites. The aggregated documentation is optimized for consumption by AI assistants like Composer, Claude, and other LLM-powered tools.
documentor/
├── docs/
│ ├── external/ # Aggregated documentation files (organized by source)
│ └── guide/ # Comprehensive guides and documentation
├── tools/
│ └── doc-aggregator/ # Tool for spidering documentation sites
└── prompts/ # Prompts for AI assistants
For a complete list of available documentation, see the Documentation Guide.
This repository serves as a centralized location for:
- Documentation Aggregation: Using the
doc-aggregatortool to spider and consolidate external documentation sites - Documentation Storage: Storing aggregated documentation in a structured format optimized for LLM consumption
- Index Creation: Creating grep indexes and other search indices for efficient document retrieval
- AI Assistant Integration: Providing documentation context for AI assistants working on related projects
All aggregated documentation is stored in docs/external/ organized by source. Each documentation set typically includes:
- Main Documentation Files:
.mdor.txtfiles containing the full aggregated content - Index Files:
*-index.mdfiles containing structured indexes for quick reference - Progress Files:
*-progress.jsonfiles tracking crawl progress (used by the aggregator tool)
See the Documentation Guide for a complete catalog of available documentation.
The tools/doc-aggregator/ directory contains a Node.js tool for spidering documentation sites. See tools/doc-aggregator/README.md for detailed usage instructions.
Quick Start:
- Edit
tools/doc-aggregator/config.jsonto add documentation sites - Run
node tools/doc-aggregator/run-direct.js - Generated files appear in
docs/external/
To add documentation for a new site:
- Configure: Add a new site entry to
tools/doc-aggregator/config.json - Crawl: Run the doc-aggregator tool to spider the site
- Index: Create a grep index using the prompt in
prompts/(see below) - Organize: Place files in appropriate subdirectories under
docs/external/
After aggregating documentation, create a grep index to enable efficient search. See prompts/create-grep-index.md for instructions on how to use Composer to generate accurate grep indexes for any documentation file.
The doc-aggregator tool supports multiple update strategies including automatic change detection. By default, it uses smart update mode which automatically detects and updates changed content.
Quick Update Options:
- Automatic Updates: Run
node fetch-docs.js --updateto check for changes (recommended) - Force Update: Run
node fetch-docs.js --forcefor a complete re-crawl - Configure Updates: Set
updateModeandreCrawlAfterDaysin config.json for automatic updates
For detailed information, see:
- Documentation Guide - Complete catalog of available documentation
- Update Behavior Guide - Comprehensive guide to how updates work
- Doc Aggregator README - Tool usage and configuration
- Regular Updates: Periodically re-crawl documentation to capture updates
- Version Control: Commit documentation files to track changes over time
- Index Maintenance: Regenerate indexes when documentation is updated
- Organization: Keep related documentation in logical subdirectories
This repository contains aggregated documentation from various sources. Each source maintains its own copyright and licensing. This repository is for organizational and reference purposes only.