WikiBricks

A wiki for your AI agent, on Databricks. Delta + Vector Search + Unity Catalog, exposed as native MCP tools. Your agent gets a persistent, versioned, typed-link knowledge store that grows from its own sessions — and you can search and read those sessions back from your next conversation.

Two pieces, both optional, designed to be used together:

The wiki store — a Databricks Asset Bundle that deploys the schema, tables, Vector Search index, UC functions, and a daily maintenance job. One databricks bundle deploy, then any agent that speaks MCP can read and write it.
The recorder — an optional Claude Code plugin that records every session as one wiki page and exposes 5 MCP tools to the agent so it can search prior sessions. This is the easy on-ramp: 5 minutes to install, immediately useful.

Grounding idea: Andrej Karpathy's LLM Wiki pattern — instead of re-retrieving raw documents at every query, the agent incrementally compiles a structured, interlinked wiki it maintains itself. Knowledge compounds instead of getting re-derived. See also Context and Memory for Agents on Databricks for the design rationale.

5-minute install: the personal recorder

The fastest way to see what WikiBricks is: install the recorder plugin and let it record one Claude Code session.

1. Deploy the wiki store (one-time, ~3 minutes)

git clone https://github.com/philtief/wikibricks.git
cd wikibricks
cp databricks.override.example.yml databricks.override.yml
# edit: host, profile, catalog, schema, warehouse_id

databricks bundle deploy --target dev
databricks bundle run deploy_wiki_store --target dev

Idempotent. Creates the schema, Delta tables, Vector Search index, 8 UC functions, and the daily curate Lakeflow Job.

2. Install the plugin (~1 minute)

In any Claude Code session:

/plugin marketplace add https://github.com/philtief/wikibricks.git
/plugin install wikibricks-recorder@wikibricks

Then once per machine:

uvx --from "git+https://github.com/philtief/wikibricks.git@v0.3.1" \
    wiki-init personal      # | team-create | team-join

That's it. Open a new Claude Code session and the recorder writes one page per session into sessions/<user>/YYYY/MM/DD/<sid>. Five MCP tools (wiki_search, wiki_read_full, wiki_index, wiki_write_page, wiki_promote_answer) appear automatically so the agent can search and read your prior sessions. Plugin details: plugin/README.md.

Why a wiki, not a log

Agents forget. Context windows are not memory, pasted docs are not memory, embeddings alone are not memory. What you actually want:

pages with stable paths and titles
typed links between them (not a generic related_to bag)
history — who wrote what, when, and what changed
search by meaning or keyword
growth — the store gets better as the agent answers more questions

WikiBricks delivers all five on managed Databricks services. No bespoke vector DB, no separate MCP server, no model dependency inside the library core — the agent calling the wiki is the only LLM in the loop.

The maintenance loop

Every deployment ships one Lakeflow Job (wikibricks_curate) that runs daily, with three tasks. The first is the contract; the other two are opt-in.

curate (LLM-free). Proposes new typed edges via Vector Search nearest-neighbor + exact-title matching, tagged with confidence + origin. Auto-commits anything above auto_commit_threshold=0.85; leaves the rest for the agent. Runs lint (orphans, stale pages, duplicates, broken links), deterministic link repair, and flags pages oversize / empty / ok.
segregate (LLM-driven, opt-in). Picks up oversize pages, splits each into a parent (summary + ToC) plus N chunk children. Reassembly via fn_wiki_read_full(parent_path). Drop the task to run fully LLM-free.
promote (LLM-driven, opt-in). Mines agent session traces, clusters recurring questions, synthesizes one canonical answer per cluster, scores it with an LLM judge, and writes passing clusters to promoted/<slug> with cites edges back to source pages. This is what makes the wiki grow from agent traces.

Every write, promote, and index sync appends to wiki_log so operators can watch the pipeline. scripts/diagnose_traces.py --window-days 7 summarizes trace volume, cluster eligibility, and recent events.

Library API

from wikibricks import WikiClient

wiki = WikiClient(warehouse_id="abc123")

wiki.write_page("topics/vector-search", title="Vector Search",
                content={"summary": "...", "body": "..."}, tags=["retrieval"])

# Agent-in-the-loop: WikiBricks proposes, agent decides.
candidates = wiki.propose_edges("topics/vector-search", min_similarity=0.70)
wiki.commit_edges([c for c in candidates if my_agent_approves(c)])

wiki.search("what index modes exist", mode="HYBRID")    # HYBRID / ANN / FULL_TEXT
wiki.graph_neighbors("topics/vector-search", depth=2)
wiki.history("topics/vector-search")

# Promote a Q&A pair into a canonical synthesis page (cites every source).
wiki.promote_answer(query="...", answer="...",
                    source_pages=[wiki.read_page("topics/vector-search")])

Full surface: src/wikibricks/client.py. For agent runtimes that can't speak Python directly, use make_agent_tools(...) to get tool-callable equivalents of the write methods.

MCP tools

Eight UC functions auto-exposed via Databricks managed MCP at https://<workspace>/api/2.0/mcp/functions/<catalog>/<schema> (OAuth, unity-catalog scope, UC permissions enforced):

Tool	Description
`fn_wiki_search(question, num_results)`	HYBRID Vector Search over `pages`
`fn_wiki_read(page_path)`	Read a page by path
`fn_wiki_read_full(parent_path)`	Read parent + chunk children
`fn_wiki_history(page_path)`	Full version history
`fn_wiki_log(num_entries)`	Recent operation log
`fn_wiki_index()`	Page catalog
`fn_wiki_schema()`	Conventions (page types, link types, tags)
`fn_wiki_write_help()`	How to write good wiki pages

Pass --var="enabled_uc_functions=fn_wiki_search,fn_wiki_read_full,..." to bundle deploy to expose a subset — useful for read-only agents or weaker models that get distracted by long tool lists.

DML (writes) is exposed separately via make_agent_tools(...) — UC SQL functions can't perform writes.

Development

uv sync                              # core library
uv sync --extra recorder             # also install the recorder package
uv run pytest                        # 480 tests, no workspace needed
uv run ruff check src tests scripts
uv build                             # → dist/wikibricks-0.3.1-py3-none-any.whl

For the recorder, see plugin/README.md. For deeper deploy customization (custom seed corpora, app env vars, ad-hoc overrides), see docs/ and the bundle config in databricks.yml. Coding agents should read AGENTS.md for repo conventions, hard rules, release checklist, and the dev → public sync checklist.

What this is not

Not a multi-hop QA system — the agent does the reasoning.
Not a vector DB product — the index is Databricks Vector Search.
Not a SaaS — a Databricks Asset Bundle that deploys into your workspace.
Not scratch memory — per-session conversation state belongs elsewhere.

License

Apache 2.0 — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.claude-plugin		.claude-plugin
.github		.github
app		app
docs/img		docs/img
examples		examples
notebooks		notebooks
plugin		plugin
resources		resources
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
.python-version		.python-version
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
NEXT_STEPS.md		NEXT_STEPS.md
README.md		README.md
databricks.override.example.yml		databricks.override.example.yml
databricks.yml		databricks.yml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WikiBricks

5-minute install: the personal recorder

1. Deploy the wiki store (one-time, ~3 minutes)

2. Install the plugin (~1 minute)

Why a wiki, not a log

The maintenance loop

Library API

MCP tools

Development

What this is not

License

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

WikiBricks

5-minute install: the personal recorder

1. Deploy the wiki store (one-time, ~3 minutes)

2. Install the plugin (~1 minute)

Why a wiki, not a log

The maintenance loop

Library API

MCP tools

Development

What this is not

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages