Codebase Intelligence Platform

💚 Free & open source, forever. Every feature is available to everyone — no paywalls, no tiers, no sign-up. Clone and self-host it, or use the hosted app. Licensed under Apache-2.0. If it helps you, sponsoring is welcome but always optional.

▶ Try it / deploy your own: · see DEPLOY.md for CLI & self-hosting.

🖥️ CLI: index any repo and ask questions with citations (zero-network; ANTHROPIC_API_KEY optional) — published on npm (needs Node ≥18):

npm i -g @mnikks01/codeintel    # installs the `codeintel` command — or use npx (no install) below
npx @mnikks01/codeintel search ./src "where is auth handled?"
npx @mnikks01/codeintel ask ./ "how does retrieval fuse vector and lexical hits?"
npx @mnikks01/codeintel related ./ src/index.ts

From a clone instead: node engine/src/cli.ts <args>.

Deep, queryable understanding of any codebase. Ask questions, get grounded answers with citations; auto-generated living docs; impact/blast-radius analysis; onboarding acceleration — exposed as an API and an MCP server so any tool or agent can use it.

Project #2 (Core engine) · Priority ⭐⭐⭐ · Difficulty: High · Time-to-MVP: 2–3 months

About this repository

This is Codebase Intelligence (project #2), extracted from a larger "AI Startup Lab." The docs at the repo root are the product spec/vision; references to sibling pieces (ContextOS, DECISION_LOG, the *_GUIDE files) point to that broader context and aren't part of this standalone repo.

The working engine is in engine/ — a real, evaluated RAG-over-code pipeline (ingest → chunk → embed → hybrid retrieve → re-rank → import-graph → cite/answer), pure TypeScript, zero-network by default (real embedding/LLM APIs swap in for production). Try it:

cd engine
node scripts/demo.ts        # index this engine's own source + ask it questions
node scripts/eval.ts        # retrieval quality (recall@k, MRR) — hybrid vs +re-ranker

Current eval (local-embeddings baseline, on the engine's own source): recall@3 = 100%, MRR 0.818 → 0.864 with the re-ranker. See engine/README.md.

Licensed under Apache-2.0 — see LICENSE.

What we're building

A platform that ingests a codebase and builds a rich, retrievable understanding of it: semantic + lexical search, grounded Q&A ("how does auth work here?"), automatically maintained documentation and architecture maps, and impact analysis ("what breaks if I change this function?"). It's the RAG-over-code engine — the hardest, most defensible piece of the whole lab, reused by ContextOS (#1) and System Design Assistant (#6).

Why we're building it

"Talk to your codebase" is validated, hot demand (2026) but most tools are shallow — naive chunking, weak cross-file reasoning, no governance.
It's the technical center of gravity: master RAG-over-code once, power three products.
Onboarding, code understanding, and impact analysis are universal, expensive pains.

Who it's for

Engineering teams with large/legacy/polyrepo codebases; new hires; agencies onboarding to client code; any AI agent needing to understand code. See CUSTOMERS.md.

How it works

flowchart LR
    REPO[Repo - GitHub App] --> ING[Ingest + AST chunk]
    ING --> EMB[Embed - code + NL summary]
    EMB --> VEC[(pgvector + full-text)]
    GRAPH[Symbol/dependency graph] --> STORE[(Postgres)]
    ING --> GRAPH
    Q[Question] --> RET[Hybrid retrieve + graph expand + rerank]
    VEC --> RET
    STORE --> RET
    RET --> LLM[Grounded answer + citations]
    EMB --> DOCS[Auto-docs + architecture map]

Core capabilities (maturity ladder)

Stage	Capability
MVP	Connect repo, index, semantic+keyword code search, grounded Q&A with file:line citations, auto repo summary
V1	Teams/RBAC, living auto-docs, architecture diagrams, multi-repo, search/reporting, impact analysis
V2	Agents (codebase Q&A agent, PR review), MCP server, automation, enterprise controls
V3	On-prem/VPC, SSO, advanced governance, org-wide knowledge graph

Full inventory: FEATURES.md.

Document map

VISION · PROBLEM · CUSTOMERS · FEATURES · USER_STORIES · ARCHITECTURE · TECH_STACK · DATABASE · API_DESIGN · AI_ARCHITECTURE · RAG · MCP · AGENT_DESIGN · SECURITY · OBSERVABILITY · GUARDRAILS · DEVOPS · TASKS · SPRINTS · PRICING · GTM · SALES · RISKS · HIRING · OPEN_SOURCE · RESUME_VALUE · CLAUDE.md · AGENTS.md · llms.txt · mcp.json

One-liner

Codebase Intelligence is the brain that actually understands your code — searchable, explainable, and queryable by humans and agents alike.

Build #2 in the wedge (after #3, before #1). It is the moat. See START_HERE.md.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.claude		.claude
.github		.github
.husky		.husky
engine		engine
mcp-server		mcp-server
web		web
.editorconfig		.editorconfig
.gitignore		.gitignore
.nvmrc		.nvmrc
AGENTS.md		AGENTS.md
AGENT_DESIGN.md		AGENT_DESIGN.md
AI_ARCHITECTURE.md		AI_ARCHITECTURE.md
API.md		API.md
API_DESIGN.md		API_DESIGN.md
ARCHITECTURE.md		ARCHITECTURE.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
CUSTOMERS.md		CUSTOMERS.md
DATABASE.md		DATABASE.md
DEPLOY.md		DEPLOY.md
DEVOPS.md		DEVOPS.md
FEATURES.md		FEATURES.md
GO_LIVE.md		GO_LIVE.md
GTM.md		GTM.md
GUARDRAILS.md		GUARDRAILS.md
HIRING.md		HIRING.md
LICENSE		LICENSE
MCP.md		MCP.md
OBSERVABILITY.md		OBSERVABILITY.md
OPEN_SOURCE.md		OPEN_SOURCE.md
PRICING.md		PRICING.md
PROBLEM.md		PROBLEM.md
RAG.md		RAG.md
README.md		README.md
RESUME_VALUE.md		RESUME_VALUE.md
RISKS.md		RISKS.md
SALES.md		SALES.md
SECURITY.md		SECURITY.md
SPRINTS.md		SPRINTS.md
TASKS.md		TASKS.md
TECH_STACK.md		TECH_STACK.md
USER_STORIES.md		USER_STORIES.md
VISION.md		VISION.md
commitlint.config.js		commitlint.config.js
llms.txt		llms.txt
mcp.json		mcp.json
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Codebase Intelligence Platform

About this repository

What we're building

Why we're building it

Who it's for

How it works

Core capabilities (maturity ladder)

Document map

One-liner

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Codebase Intelligence Platform

About this repository

What we're building

Why we're building it

Who it's for

How it works

Core capabilities (maturity ladder)

Document map

One-liner

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages