GitHub Flashlight

A sophisticated multi-agent processing pipeline using the Claude Agent SDK that performs dependency-aware codebase analysis and visualization through multi-agent composition.

Features

Automatic Service Discovery: Identifies services in Rust (Cargo.toml), Go (go.mod), Node.js (package.json), and Python (pyproject.toml) codebases
Dependency Graph Analysis: Builds and visualizes service dependency relationships
Two-Phase Analysis:
- Phase 1: Analyzes foundation services with no dependencies
- Phase 2: Analyzes remaining services in dependency order with upstream context
Context-Aware: Code analyzers receive analyses of direct dependencies to understand integration patterns
Comprehensive Documentation: Generates system-wide architecture documentation with patterns, flows, and recommendations
Multi-Agent Orchestration: Uses specialized agents for discovery, analysis, and documentation synthesis

Component Classification

GitHub Flashlight uses a deterministic discovery engine (zero LLM calls) to classify every component into one of eight ComponentKind values via language-specific plugins:

Kind	Description
Library	Reusable code with no entrypoint
Service	Long-running process (HTTP, gRPC, daemon)
CLI	Command-line tool
Contract	Smart contract, API definition, or schema
Infra	Infrastructure-as-code or deployment config (Terraform, Helm, K8s)
Pipeline	Data pipeline or workflow definition (Airflow, dbt)
Frontend	UI application (React, Vue, Streamlit, SwiftUI)
Unknown	Could not classify deterministically

Supported Languages

Go (`go.mod`)

Service: main.go or package main with server indicators (listenandserve, grpc.newserver, net.listen, etc.) or service-like names (server, daemon, proxy, worker)
CLI: main.go with CLI indicators (cobra.command, pflag, os.args) or CLI-like names (cli, tool)
Library: No main.go, no package main, or has a cmd/ directory (executables inside cmd/ become their own components)
Supports single-module monorepos with per-package discovery and Go import tracing

Rust (`Cargo.toml`)

Service: [[bin]] or src/main.rs with server framework deps (actix-web, axum, warp, rocket, tonic, hyper) or service-like names
CLI: Executable with CLI framework deps (clap, structopt, argh) or CLI-like names
Library: [lib] section only, or hybrid crates with both lib.rs and main.rs
Supports Cargo workspaces with glob member patterns

Python (`pyproject.toml`)

Pipeline: Markers for airflow, dagster, prefect, dbt, luigi
Frontend: Markers for streamlit, gradio, panel, dash
Service: Web framework deps (fastapi, flask, django, starlette, etc.) with [project.scripts] or __main__.py
CLI: Has [project.scripts] or __main__.py without web framework deps
Library: No entry points or framework markers
Supports both PEP 621 and Poetry dependency formats

TypeScript / JavaScript (`package.json`)

CLI: "bin" field present
Frontend: Frontend framework deps (react, vue, svelte, angular, next, nuxt, remix, solid-js, etc.)
Service: Server framework deps (express, fastify, koa, nestjs, hono) with a "start" script or "main" field
Library: No binary, framework, or server indicators
Supports npm/yarn/pnpm workspaces with recursive member discovery

Solidity (`foundry.toml`, `hardhat.config.ts`/`.js`)

Contract: Contains contract, abstract contract, or interface declarations
Library: All declarations are Solidity library keyword
Supports both Foundry and Hardhat projects with import remapping and multi-package discovery

Swift (`Package.swift`)

Service: .executableTarget with server framework indicators (Vapor, Hummingbird, SwiftNIO, GRPC) or service-like names
Frontend: Executable with iOS/macOS indicators (UIApplication, SwiftUI, WindowGroup)
CLI: .executableTarget with argument parsing (ParsableCommand, ArgumentParser) or CLI-like names; default for unclassified executables
Library: .target without main.swift or @main, .binaryTarget, .systemLibrary
Supports SPM multi-target packages with per-target discovery

Detection Pipeline

Manifest discovery: Scans for language-specific manifest files (Cargo.toml, go.mod, package.json, pyproject.toml, foundry.toml, Package.swift)
Manifest analysis: Checks for binary/entrypoint indicators in manifest structure
File structure: Looks for main.rs, main.go, __main__.py, main.swift, @main attribute
Dependency scanning: Identifies framework-specific dependencies (e.g., axum -> Service, clap -> CLI, react -> Frontend)
Content scanning: Reads source files for server/CLI/UI indicators (Go reads all .go files; Swift reads up to 10 .swift files)
Name-based heuristics: Keywords like "server", "api", "daemon" -> Service; "cli", "tool" -> CLI
Default: Falls back to Library (or Service/CLI for executables, depending on the language)

Architecture

The pipeline uses four specialized roles:

Primary Leader (orchestrator)
- Discovers services by scanning for manifest files
- Builds dependency graph and determines analysis order
- Spawns code analyzer agents with appropriate context
- Spawns external service analyzers for runtime integrations
- Spawns architecture documenter for final synthesis
Code Analyzer (multiple instances)
- Deep analysis of individual services
- Examines architecture, components, data flows, dependencies, API surface
- Documents all third-party dependencies with version, category, and purpose
- Receives context from direct dependencies
- Outputs Markdown reports
External Service Analyzer (per-service instances)
- Deep-dives into how external services (databases, cloud platforms, APIs) are integrated
- Documents client libraries, authentication, API surface, and configuration
- Produces integration analysis files for architecture synthesis
Architecture Documenter (single instance)
- Synthesizes all service analyses
- Aggregates external dependencies into a complete technology inventory
- Identifies system-wide patterns
- Creates comprehensive architecture documentation

Installation

# Create virtual environment
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install dependencies
pip install -e .

# Set up API key
cp .env.example .env
# Edit .env and add your ANTHROPIC_API_KEY

Usage

# Run the pipeline
python -m github_flashlight.agent

# Or use the installed command
github-flashlight

Then provide a path to analyze:

You: Analyze the codebase at /path/to/repo

Verbose Logging

Enable detailed SDK and API interaction logging:

# Verbose mode - Shows API calls, agent spawning, and tool usage
AGENT_VERBOSE=true python -m github_flashlight.agent

# Debug mode - Full trace logging including API request/response details
AGENT_DEBUG=true python -m github_flashlight.agent

When enabled, you'll see real-time information about:

📤 API requests to Claude
📥 API responses
🚀 Subagent spawning and lifecycle
🔧 Tool calls with parameters
✅ Tool results and success/failure status
📝 Agent context and model information

This is useful for:

Understanding what the agents are doing in real-time
Debugging analysis pipeline issues
Monitoring API usage and performance
Learning how the multi-agent system orchestrates tasks

Live Observability Monitor

For real-time visual monitoring of agent execution with interactive profiling:

./observability/live_monitor.sh

This launches a web-based profiler that automatically tracks your current session, displaying tool calls, timing metrics, and agent interactions in real-time. The visualization updates live as your agents work, providing an interactive dashboard for monitoring pipeline execution and performance analysis.

The pipeline will:

Scan for services (Cargo.toml, go.mod, package.json, pyproject.toml files)
Build dependency graph
Analyze services in two phases:
- Phase 1: Services with no dependencies (parallel)
- Phase 2: Services with dependencies (in order, with context)
Generate architecture documentation

Output Structure

files/
├── service_discovery/
│   ├── services.json              # Discovered services metadata
│   └── discovery_log.md           # Human-readable discovery log
├── dependency_graphs/
│   ├── dependency_graph.json      # Machine-readable graph
│   └── dependency_graph.md        # Visualization
├── service_analyses/
│   ├── {service1}.json            # Structured analysis
│   ├── {service1}.md              # Human-readable report
│   └── ... (one pair per service)
└── architecture_docs/
    ├── architecture.md            # Comprehensive documentation
    └── quick_reference.md         # One-page summary

logs/
└── session_YYYYMMDD_HHMMSS/
    ├── transcript.txt             # Conversation log
    └── tool_calls.jsonl           # Structured tool usage

Example Analysis Flow

For a Rust codebase with this structure:

repo/
├── common-utils/          (no dependencies)
├── config-loader/         (no dependencies)
├── database-layer/        (depends on common-utils)
├── auth-service/          (depends on database-layer)
└── api-gateway/           (depends on auth-service, database-layer)

The agent will:

Phase 1: Analyze common-utils and config-loader in parallel
Phase 2:
- Analyze database-layer with context from common-utils
- Analyze auth-service with context from database-layer only (not common-utils)
- Analyze api-gateway with context from auth-service and database-layer
Synthesis: Generate comprehensive architecture documentation

Key Design Principles

Direct Dependencies Only: Analyzers receive context only from direct dependencies, not transitive ones
Dependency Order: Services are analyzed in topological order to ensure dependencies are analyzed first
Parallel Execution: Services at the same dependency level are analyzed in parallel
Structured Output: Both machine-readable (JSON) and human-readable (Markdown) outputs

Supported Languages

Rust: Full support (Cargo.toml discovery, dependency extraction)
Go: Full support (go.mod discovery, dependency extraction)
Node.js: Partial support (package.json discovery)
Python: Partial support (pyproject.toml discovery)

Requirements

Python 3.10+
Claude API key
Access to the codebase to analyze

Development

# Install with dev dependencies
pip install -e ".[dev]"

# Run tests (when available)
pytest

How It Works

The primary leader orchestrates a sophisticated multi-phase workflow:

Discovery Phase

Uses Glob to find manifest files (Cargo.toml, go.mod, package.json, pyproject.toml)
Reads each manifest to extract service metadata
Identifies internal dependencies (path-based in manifests)
Saves service inventory to JSON

Graph Building Phase

Constructs directed dependency graph
Calculates analysis order using two-phase approach:
- Phase 1: Services with in-degree 0 (no dependencies)
- Phase 2: Topological sort of remaining services
Visualizes graph in both JSON and Markdown

Analysis Phase

Phase 1: Spawns code-analyzer for each no-dependency service (parallel)
Phase 2: For each remaining service:
- Waits for its direct dependencies to complete
- Loads direct dependency analyses
- Builds context summary (architecture, APIs, components)
- Spawns code-analyzer with context
- Ensures proper ordering while maximizing parallelism

Synthesis Phase

Spawns architecture-documenter after all analyses complete
Reads all service analyses and dependency graph
Identifies system-wide patterns and architectural approaches
Generates comprehensive documentation with:
- System overview
- Service catalog
- Dependency visualization
- Architectural patterns
- Technology stack
- Major data flows
- Development guide
- Recommendations

Contributing

This project showcases the Claude Agent SDK's multi-agent composition capabilities. Feel free to extend it with:

Additional language support (Java, C#, etc.)
Enhanced metrics collection (LOC, complexity, test coverage)
Incremental analysis for large repositories
Custom analysis plugins
Additional visualization options

License

See parent repository for license information.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.burr-ui-venv		.burr-ui-venv
.github/workflows		.github/workflows
agent		agent
observability		observability
scripts		scripts
templates		templates
test-examples/task-manager		test-examples/task-manager
tests		tests
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
SYSTEM.md		SYSTEM.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

GitHub Flashlight

Features

Component Classification

Supported Languages

Go (go.mod)

Rust (Cargo.toml)

Python (pyproject.toml)

TypeScript / JavaScript (package.json)

Solidity (foundry.toml, hardhat.config.ts/.js)

Swift (Package.swift)

Detection Pipeline

Architecture

Installation

Usage

Verbose Logging

Live Observability Monitor

Output Structure

Example Analysis Flow

Key Design Principles

Supported Languages

Requirements

Development

How It Works

Discovery Phase

Graph Building Phase

Analysis Phase

Synthesis Phase

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors 1

Languages

Go (`go.mod`)

Rust (`Cargo.toml`)

Python (`pyproject.toml`)

TypeScript / JavaScript (`package.json`)

Solidity (`foundry.toml`, `hardhat.config.ts`/`.js`)

Swift (`Package.swift`)

Packages