"Archetype: experimental. Role: Discovery tool for finding repos with AI-native documentation patterns."
A discovery tool for finding GitHub repositories that use discovery patterns in their documentation - a signal that the architect independently figured out AI-native development practices.
This isn't about evangelism or user acquisition. It's about finding the "five people on the planet" working on AI-native architecture patterns and starting conversations.
We're searching for a specific architectural approach:
- Discovery commands over static lists - Documentation that teaches through exploration (
kubectl get pods,tree -L 2,grep -r "pattern") - Pattern recognition - If someone independently adopted discovery-based documentation, they "get it"
- Production thinking - Commands that help you understand real systems, not tutorials
- Filename agnostic - Whether it's README.md, CONTRIBUTING.md, or CLAUDE.md doesn't matter
Example discovery pattern:
## Service Architecture
**Discovery**:
\```bash
# List all running resources
tilt get uiresources
# View pod status
kubectl get pods -n budget-analyzer
\```If a root markdown file contains commands like these, that architect is thinking about AI-native development.
Two-Stage Discovery:
-
Topic Pre-Filtering - Use GitHub Search API to find candidate repos by topics
- Primary:
topic:ai-native - Fallback:
topic:ai-assisted-development - Expansion:
topic:devcontainer topic:kubernetes - Reduces search space from millions to ~50 repos
- Primary:
-
Content Search - Scan pre-filtered repos for discovery patterns
- Fetch root-level markdown files
- Look for discovery command patterns (grep, kubectl, docker, tree, etc.)
- Score based on pattern depth and quality
Analysis:
- Extract contact information (public data only)
- Score "peer potential" based on production signals
- Generate registry of discoveries
# Clone the repository
git clone https://github.com/budgetanalyzer/claude-discovery.git
cd claude-discovery
# Set up environment
cp .env.example .env
# Edit .env and add your GitHub Personal Access Token
# Install dependencies
pip install -r requirements.txt
# Run discovery
python -m src.main
# View results
cat DISCOVERIES.md # Human-readable findings
cat discoveries.json # Machine-readable dataLive Site: https://budgetanalyzer.github.io/claude-discovery (once deployed)
Local Preview:
# Generate static site from discoveries
python3 src/site_generator.py
# Serve locally
cd site && python3 -m http.server 8000
# Open http://localhost:8000The registry site includes:
- Searchable discovery registry - Filter by quality, language, patterns
- Pattern library - Common discovery patterns with examples
- About page - Philosophy and principles
# Add repository to opt-out list
python3 src/opt_manager.py add owner repo "User request"
# Remove from opt-out (opt back in)
python3 src/opt_manager.py remove owner repo
# Check opt-out status
python3 src/opt_manager.py check owner repo
# Filter discoveries and regenerate site
python3 src/opt_manager.py filter
python3 src/site_generator.pySee docs/OPT-IN-OUT.md for complete opt-in/opt-out policy.
- Python 3.10+
- GitHub Personal Access Token (with
reposcope for private search,public_repofor public only)
Create a .env file with your GitHub token:
GITHUB_TOKEN=ghp_your_token_hereOptional configuration in config/search_queries.yaml:
- Topic tiers and filters
- Quality scoring weights
- Contact extraction patterns
High-potential peers have:
- Production implementations (not tutorials)
- Multiple related repositories (microservices pattern)
- CI/CD and deployment configurations
- Thoughtful documentation with discovery patterns
- Recent commits and active development
We're NOT looking for:
- Repositories that just renamed README to CLAUDE.md
- Static documentation without discovery commands
- Abandoned or tutorial-only projects
- Mass-adoption or ecosystem building
We're finding people who already figured it out independently, not convincing people to adopt a convention.
Finding 5 peer architects > cataloging 500 repos with renamed READMEs.
Only index public information. Respect robots.txt. Provide opt-out mechanisms.
This tool itself exemplifies the patterns:
- Discoverable (has its own CLAUDE.md)
- Pattern-based documentation
- Simple, bounded context
- Runnable without complex setup
We're establishing ai-native as a GitHub topic:
- Repositories built FOR and WITH AI as collaborative partner
- Discovery patterns in documentation
- Production-grade implementations
- Containerized development environments for AI agents
Dogfooding: Budget Analyzer repositories use this topic, making them discoverable by this tool.
Current Phase: Phase 3 Complete - Connection Layer
Phase 1 - MVP Discovery Engine: ✓ Complete
- Topic-based pre-filtering
- Content search for discovery patterns
- Contact extraction
- Quality scoring
- Report generation (JSON + Markdown)
- 24 repositories discovered, 3 high-quality peers identified
Phase 2 - Pattern Recognition: ✓ Complete
- Pattern detection and categorization
- Quality signal analysis
- Findings documentation
Phase 3 - Connection Layer: ✓ Complete
- GitHub Pages static site with searchable registry
- Pattern library showcasing discovered patterns
- GitHub Discussions templates for pattern sharing
- Opt-in/opt-out mechanism with privacy-first approach
- Automated site generation from discoveries
Next Steps:
- Enable GitHub Pages and Discussions (requires manual GitHub UI steps)
- Conduct outreach to high-quality peers (score 8+)
- Document conversations and learnings
- Iterate on discovery criteria based on feedback
See docs/PLAN.md for detailed roadmap and docs/DEPLOYMENT.md for deployment instructions.
Hypothesis: There are architects independently discovering that microservices, pattern-based documentation, and discovery commands enable AI-native development.
Evidence: You found it. Others will too.
Network Effect: Discovery → Connection → Learning → Ecosystem
Want to be included in the registry? Submit an opt-in request
Want to be excluded? Submit an opt-out request - no explanation needed, we respect your privacy.
See docs/OPT-IN-OUT.md for our complete privacy policy.
GitHub Discussions (once enabled):
- Share discovery patterns you've found
- Discuss AI-native architecture approaches
- Connect with peer architects
- Learn from others' implementations
This is a discovery project, not a community project (yet). If you've independently adopted discovery patterns in your documentation, we'd love to hear from you.
For now:
- Star the repo if the pattern resonates
- Add
ai-nativetopic to your repos if you're doing this - Share your discoveries
- Open an issue for bugs or feature requests
MIT License - See LICENSE file for details
This project uses discovery patterns in its own documentation. Point your AI at github.com/budgetanalyzer/claude-discovery and see how it works.
Status: Alpha - Building the discovery engine
Author: Human architect + AI collaborator
Date: 2025-01-24
"This is how movements start: not with manifestos, but with people independently discovering truth and finding each other."