VectCode

A semantic code search tool that indexes code repositories into a vector database, enabling natural language search across your codebase.

Features

Multi-repository indexing: Index multiple Go projects into a unified knowledge base
Semantic search: Query your codebase using natural language via vector embeddings
ChromaDB integration: Fast vector storage and retrieval
MCP Server: Use VectCode with Claude Desktop and other LLM clients via Model Context Protocol
Ollama embeddings: Free, local embeddings with BGE-M3 model (or OpenAI alternative)

Installation

Prerequisites

ChromaDB - Vector database server:

docker run -d -p 8000:8000 chromadb/chroma

Ollama (recommended) - Local embedding model:

# Install Ollama
curl -fsSL https://ollama.com/install.sh | sh

# Pull the BGE-M3 embedding model
ollama pull bge-m3

Build from Source

# Build CLI tool
go build -o vectcode ./cmd/vectcode

# Build MCP server (optional, for Claude Desktop integration)
go build -o vectcode-mcp-server ./cmd/mcp-server

Quick Start

1. Setup Configuration

mkdir -p ~/.vectcode
cp config.example.yaml ~/.vectcode/config.yaml

Edit ~/.vectcode/config.yaml to configure ChromaDB and Ollama endpoints.

2. Index a Project

./vectcode index --path ~/projects/my-service --name my-service

Re-indexing with clean slate:

# Use --clean to delete existing data first (removes orphaned chunks from deleted code)
./vectcode index --path ~/projects/my-service --name my-service --clean

3. Query the Codebase

./vectcode query --query "where is the user authentication handler?" --limit 5

4. List Indexed Projects

./vectcode list

5. Delete a Project

./vectcode delete --name my-service

MCP Server (Claude Desktop Integration)

VectCode can be used as an MCP (Model Context Protocol) server, allowing Claude Desktop and other LLM clients to search your indexed codebases during conversations.

See MCP_SETUP.md for detailed setup instructions.

Quick setup for Claude Desktop (macOS):

Build the MCP server:

go build -o vectcode-mcp-server ./cmd/mcp-server
sudo cp vectcode-mcp-server /usr/local/bin/

Edit ~/Library/Application Support/Claude/claude_desktop_config.json:

{
  "mcpServers": {
    "vectcode": {
      "command": "/usr/local/bin/vectcode-mcp-server"
    }
  }
}

Restart Claude Desktop and start searching your code!

CLI Options

All commands support a --config flag to specify a custom config file:

./vectcode --config /path/to/config.yaml index --path . --name myproject

Configuration

VectCode uses a configuration file at ~/.vectcode/config.yaml:

vector_store:
  type: chroma
  collection: vectcode
  options:
    endpoint: http://localhost:8000

embeddings:
  # Option 1: Ollama (local, free, recommended)
  provider: ollama
  model: bge-m3
  endpoint: http://localhost:11434

  # Option 2: OpenAI (requires API key)
  # provider: openai
  # model: text-embedding-3-small
  # api_key_env: OPENAI_API_KEY

Architecture

vectcode/
├── cmd/
│   ├── vectcode/      # CLI entry point
│   └── mcp-server/     # MCP server for LLM integration
├── pkg/
│   ├── parser/         # Code parsing (AST analysis)
│   ├── chunker/        # Code chunking logic
│   ├── embedder/       # Generate embeddings (Ollama/OpenAI)
│   ├── vectorstore/    # Vector store interface and ChromaDB implementation
│   ├── indexer/        # Orchestrates parsing and storing
│   ├── query/          # Query engine for semantic search
│   ├── config/         # Configuration management
│   └── mcp/            # MCP protocol and server implementation

How It Works

Parsing: VectCode parses Go source files using AST analysis to extract:
- Functions and methods
- Struct and interface definitions
- Constants and global variables
- Documentation strings
Chunking: Each code element (function, struct, etc.) is extracted as a separate chunk with:
- Code content
- File path and line numbers
- Documentation
- Type information
Embedding: Code chunks are converted to vector embeddings using:
- Ollama with BGE-M3 model (local, free)
- Or OpenAI's text-embedding models
Storage: Embeddings and metadata are stored in ChromaDB for fast similarity search
Querying: Natural language queries are embedded and matched against stored code chunks using cosine similarity

Supported Languages

Currently supported:

Go: Full AST-based parsing with function, method, struct, and interface extraction

Use Cases

Code Discovery: Find relevant code examples across multiple repositories
Onboarding: Help new team members understand codebase structure
Refactoring: Find all usages and similar patterns
Documentation: Locate functions and their documentation
LLM Integration: Use with Claude Desktop for AI-powered code assistance

Re-indexing Behavior

When you re-index a project, VectCode uses deterministic IDs (based on project:file:name) to handle updates:

Without --clean flag:

Existing code chunks are updated (upsert behavior)
New code chunks are added
⚠️ Orphaned chunks remain: If you delete code from your project, those chunks stay in the database

With --clean flag:

All existing project data is deleted first
Then indexes from scratch
✅ No orphaned chunks: Ensures database exactly matches current code state

When to use --clean:

After deleting or renaming files/functions
When you want to ensure a fresh, accurate index
Troubleshooting stale search results

Roadmap

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.claude		.claude
cmd		cmd
examples		examples
pkg		pkg
.gitignore		.gitignore
GETTING_STARTED.md		GETTING_STARTED.md
MCP_SETUP.md		MCP_SETUP.md
Makefile		Makefile
QUICKSTART.md		QUICKSTART.md
README.md		README.md
STRUCTURE.txt		STRUCTURE.txt
TESTING.md		TESTING.md
config.example.yaml		config.example.yaml
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VectCode

Features

Installation

Prerequisites

Build from Source

Quick Start

1. Setup Configuration

2. Index a Project

3. Query the Codebase

4. List Indexed Projects

5. Delete a Project

MCP Server (Claude Desktop Integration)

CLI Options

Configuration

Architecture

How It Works

Supported Languages

Use Cases

Re-indexing Behavior

Roadmap

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

jayjzheng/vectcode

Folders and files

Latest commit

History

Repository files navigation

VectCode

Features

Installation

Prerequisites

Build from Source

Quick Start

1. Setup Configuration

2. Index a Project

3. Query the Codebase

4. List Indexed Projects

5. Delete a Project

MCP Server (Claude Desktop Integration)

CLI Options

Configuration

Architecture

How It Works

Supported Languages

Use Cases

Re-indexing Behavior

Roadmap

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages