v0.1.2 — First published release
Local-first knowledge compiler. Ingest articles, papers, PDFs, YouTube transcripts into a searchable knowledge graph — then wire it into your AI assistant.
Install
npm install -g lumen-kb
lumen init
echo 'ANTHROPIC_API_KEY=sk-ant-...' > ~/.lumen/.env
Then in Claude Code:
lumen install claude
What's in v0.1.2
Ingestion — URL, PDF, YouTube (Innertube captions), arXiv, local files and folders. SHA-256 dedup. Structural chunking (Markdown/HTML/plain text, merge <50t, split >1000t).
Search — 3-signal hybrid: BM25 (FTS5 Porter stemmed) + TF-IDF (in-memory cosine) + Vector ANN (sqlite-vec). Fused via Reciprocal Rank Fusion (k=60). Token budget flag: lumen search "query" -b 4000.
Compilation — LLM extracts concepts + weighted edges with compiled truth + timeline per concept. Parallel: lumen compile -c 5. Model override: --model claude-haiku-4-5-20251001. Prompt caching cuts cost 60-80%.
Enrichment — Concepts auto-escalate: Tier 3 (stub) → Tier 2 (enriched, 3+ mentions) → Tier 1 (full, 6+ mentions). lumen enrich --status.
Graph — PageRank, BFS shortest path, N-hop neighborhood, label propagation communities. lumen graph pagerank, lumen graph path <a> <b>.
Agent wiring — lumen install claude generates 5 files: CLAUDE.md (brain-first protocol), .mcp.json, skill, PreToolUse hook, Stop hook. Agent checks KB before web search, cites [Source: title], captures ideas after every response.
MCP server — 19 tools via stdio: search, brain_ops, compile, capture, add, concept, path, neighbors, god_nodes, communities, pagerank, query, status, profile, session_summary, add_link, backlinks, links, community.
Streaming — lumen ask streams tokens to stdout as they arrive.
Tests — 28 tests: compress pipeline (13), graph engine + PageRank + communities (15), delta stubs (10 todos).
Web UI — Next.js 15 dashboard with search, concept browser, graph visualization.
PRs
- #3 — Signal capture + tiered enrichment (schema v9)
- #4 — Streaming responses + always-on brain wiring
- #5 — Parallel compile, brain-first CLAUDE.md, CLI fixes
- #6 — MCP compile tool, tests, npm publish, MIT license
- #7 — CHANGELOG, CONTRIBUTING, SECURITY, .env.example, knip
What's next (v0.2.0)
- Vector embeddings wired into default search path
- Delta module: smart recompilation of only changed sources
- Browser extension for capturing highlights and URLs