Spec Companion

A desktop app that verifies your code actually implements what your spec says it should. Upload a markdown spec, generate tests from requirements, run them against your codebase, and get alignment reports showing exactly where coverage gaps exist.

Built with Tauri v2 (Rust backend, React frontend). Runs locally, works offline, keeps your code and specs private.

What It Does

Spec in, coverage report out. The full workflow:

Point it at your project -- select your codebase directory, upload a markdown spec
Requirements get extracted automatically -- the parser identifies functional requirements, constraints, and user stories from your spec's structure
Generate tests from requirements -- choose template mode (instant, offline) or LLM mode (Claude API, richer tests) for Jest or PyTest
Execute tests against your codebase -- runs Jest/PyTest with real-time progress, captures stdout/stderr, enforces timeouts
Get an alignment report -- see coverage percentage, mismatch breakdown, and exactly which requirements lack tests, have failing tests, or are only partially covered
Export and share -- JSON, HTML, or CSV reports

Key Strengths

Requirement Extraction That Works

The Markdown parser uses pulldown-cmark to walk the AST, not regex on raw text. It understands heading hierarchy, identifies requirement-bearing sections (Requirements, Features, Acceptance Criteria, User Stories, Constraints), and classifies each requirement by type (functional, non-functional, constraint) and priority. Re-parse anytime the spec changes.

Two Test Generation Modes

Template mode -- instant, offline, zero config. Produces Jest describe/it or PyTest class/def test_ skeletons with Arrange/Act/Assert structure, traceability comments linking back to requirements, and relevant import suggestions based on codebase symbol matching.
LLM mode -- sends requirement context + your codebase's function/class signatures to Claude, gets back tests with meaningful assertions, edge cases, and realistic mock data. Requires an API key (set once in Settings).

Codebase-Aware

Before generating tests, the app scans your project for code symbols (functions, classes, methods) across TypeScript, JavaScript, Python, Rust, Go, Java, Ruby, and C#. These symbols provide context for both template and LLM generation, so generated tests reference your actual code.

Real Test Execution

Not a mock runner. Spawns actual npx jest or python -m pytest processes against your codebase directory. 120-second timeout per test prevents runaways. Stdout/stderr captured in separate threads to avoid pipe deadlocks. Results stored with execution time for trend analysis.

Alignment Analysis

The report engine walks every requirement and checks: does a test exist? Has it been executed? Did it pass? The result is a coverage percentage and a categorized mismatch list:

No Test Generated -- requirement has no test at all
Not Implemented -- test exists but was never run
Test Failing -- all tests for this requirement fail
Partial Coverage -- some tests pass, some fail

Data Stays Local

SQLite database stored in your app data directory. No cloud sync, no telemetry. The only network call is the optional Claude API for LLM test generation, and only when you explicitly trigger it.

Architecture

Frontend (React 19 + TypeScript + Tailwind v4)
    |
    | Tauri IPC (invoke)
    |
Backend (Rust + Tokio async runtime)
    |
    |-- SQLite (rusqlite, 7 tables, WAL mode)
    |-- Spec Parser (pulldown-cmark AST)
    |-- Codebase Scanner (multi-language symbol extraction)
    |-- Template Generator (pattern-matched test skeletons)
    |-- LLM Generator (Claude API via reqwest)
    |-- Test Runner (process spawn with timeout)
    |-- Alignment Engine (requirement-to-result analysis)
    |-- Git Service (libgit2 for branch/commit/diff info)

Frontend state: TanStack Query v5 handles caching, invalidation, and loading states. No Redux.

Database schema: projects > specs > requirements > generated_tests > test_results, plus alignment_reports > alignment_mismatches. All with UUID primary keys, RFC3339 timestamps, and cascading deletes.

Security: All file operations validate paths are within the home directory. File size limits (50MB for spec uploads, 1MB for scanned source files). Parameterized SQL queries. Filename sanitization on uploads. Directory traversal depth limits. Codebase paths canonicalized on storage.

Supported Frameworks

Test Framework	Command	Language
Jest	`npx jest`	JavaScript / TypeScript
PyTest	`python -m pytest`	Python

Codebase scanning supports: TypeScript, JavaScript, Python, Rust, Go, Java, Ruby, C#.

Spec format: Markdown (.md, .txt, .markdown).

Getting Started

Prerequisites

Node.js (LTS)
pnpm
Rust (stable)
Tauri v2 CLI: cargo install tauri-cli --version "^2"

Development

pnpm install
pnpm tauri dev

Lean Development (Low Disk Mode)

Use lean mode when you want to minimize repo-local disk growth while iterating.

pnpm install
pnpm run dev:lean

What lean mode does:

Starts the same app entrypoint (pnpm tauri dev)
Redirects heavy build caches to temporary directories
Cleans temporary caches and heavy build artifacts when the app exits

Tradeoff:

Uses less persistent disk in the project directory
Can be slower to restart than normal dev because caches are ephemeral

Build

pnpm tauri build

Produces .app + .dmg on macOS, .msi on Windows, .deb/.AppImage on Linux.

Cleanup Commands

Targeted cleanup (heavy build artifacts only):

pnpm run clean:heavy

Full local cleanup (all reproducible local caches/artifacts, including node_modules):

pnpm run clean:full

Configuration

Open Settings in the app to configure:

Claude API Key -- required only for LLM test generation mode
Default Framework -- Jest or PyTest
Default Generation Mode -- Template or LLM
Scan Exclusion Patterns -- directories to skip during codebase scanning (e.g., dist, build, .cache)

Export Formats

Format	Contents
JSON	Full report object with all mismatches and metadata
HTML	Styled table with coverage stats, color-coded mismatch badges
CSV	Tabular data: requirement ID, section, mismatch type, details

Tech Stack

Layer	Technology
Desktop framework	Tauri v2
Backend	Rust (2021 edition)
Async runtime	Tokio
Database	SQLite via rusqlite
HTTP client	reqwest
Git	libgit2 via git2
Markdown parsing	pulldown-cmark
Frontend	React 19 + TypeScript (strict)
Styling	Tailwind CSS v4
State management	TanStack Query v5
Charts	Recharts
Code highlighting	prism-react-renderer
Markdown rendering	react-markdown

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.codex		.codex
.github/workflows		.github/workflows
docs/adr		docs/adr
openapi		openapi
public		public
scripts		scripts
src-tauri		src-tauri
src		src
tests/ui		tests/ui
.gitignore		.gitignore
.lighthouserc.json		.lighthouserc.json
.stylelintrc.cjs		.stylelintrc.cjs
AGENTS.md		AGENTS.md
README.md		README.md
eslint.config.mjs		eslint.config.mjs
index.html		index.html
package.json		package.json
playwright.config.ts		playwright.config.ts
pnpm-lock.yaml		pnpm-lock.yaml
prettier.config.cjs		prettier.config.cjs
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spec Companion

What It Does

Key Strengths

Requirement Extraction That Works

Two Test Generation Modes

Codebase-Aware

Real Test Execution

Alignment Analysis

Data Stays Local

Architecture

Supported Frameworks

Getting Started

Prerequisites

Development

Lean Development (Low Disk Mode)

Build

Cleanup Commands

Configuration

Export Formats

Tech Stack

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Spec Companion

What It Does

Key Strengths

Requirement Extraction That Works

Two Test Generation Modes

Codebase-Aware

Real Test Execution

Alignment Analysis

Data Stays Local

Architecture

Supported Frameworks

Getting Started

Prerequisites

Development

Lean Development (Low Disk Mode)

Build

Cleanup Commands

Configuration

Export Formats

Tech Stack

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages