An open-source template for building LLM-powered knowledge bases, following Andrej Karpathy's "LLM Wiki" pattern.
You provide raw sources. The LLM reads them, writes structured wiki pages, cross-links everything, and maintains it over time. You never edit the wiki directly — you curate sources and ask questions.
The system has three layers:
raw/ Sources you collect (articles, transcripts, notes, PDFs)
wiki/ LLM-written & maintained pages (summaries, concepts, entities, syntheses)
CLAUDE.md Schema that tells the LLM how to structure everything
Three operations drive the workflow:
| Operation | Trigger | What happens |
|---|---|---|
| Ingest | "ingest raw/my-source.txt" | LLM reads the source, creates a summary page, creates/updates concept and entity pages, adds cross-links, updates the index and log |
| Query | Ask any question | LLM searches the wiki, synthesizes an answer with citations, optionally creates a synthesis page for novel insights |
| Lint | "lint" or "health check" | LLM audits all pages for orphans, contradictions, missing links, incomplete sections, and low-confidence claims — fixes what it can, reports the rest |
-
Clone this repo
git clone https://github.com/YOUR_USERNAME/llm-wiki.git my-knowledge-base cd my-knowledge-base -
Customize CLAUDE.md for your domain
- Update the Purpose section with your topic
- Replace the placeholder tagging taxonomy with your own categories
- Adjust confidence level descriptions if needed
- Everything else (workflows, page formats, linking rules) works as-is
-
Drop sources into
raw/- Text files, transcripts, articles, notes — any plain text
- These are immutable once added; the LLM never modifies them
-
Tell the LLM to ingest
ingest raw/my-first-source.txtThe LLM will create summary pages, concept pages, entity pages, cross-links, and update the index.
-
Ask questions
What are the key differences between X and Y?The LLM answers from the wiki, citing specific pages.
-
Run health checks
lintThe LLM audits the wiki and fixes issues.
.
├── CLAUDE.md # Schema — the LLM's instructions
├── raw/ # Your source documents (immutable)
└── wiki/
├── index.md # Master catalog of all pages
├── log.md # Append-only activity log
├── dashboard.md # Dataview dashboard (Obsidian)
├── analytics.md # Charts View analytics (Obsidian)
├── flashcards.md # Spaced repetition cards
├── summaries/ # One page per source document
├── concepts/ # Concept and framework pages
├── entities/ # People, tools, organizations, etc.
├── syntheses/ # Cross-cutting analyses and comparisons
├── journal/ # Research/session journal entries
│ └── template.md # Journal entry template
└── presentations/ # Marp slide decks
This template includes several extras beyond the core wiki pattern:
Live queries that surface low-confidence pages, recent updates, concepts by tag, and pages with the most sources. Requires the Dataview Obsidian plugin.
Visual analytics with pie charts, bar charts, and word clouds. Requires the Charts View Obsidian plugin.
Use Mermaid code blocks in any wiki page to create flowcharts, sequence diagrams, or concept maps. Native support in Obsidian and GitHub.
Create slide decks from markdown using Marp. Drop presentation files in this directory.
Track your research sessions, experiments, or applied work with the included template. The LLM can reference journal entries when answering queries.
Flashcards in the format used by the Spaced Repetition Obsidian plugin. Ask the LLM to generate flashcards from any wiki page.
This repo works with Claude Code's MCP server capabilities. Point an MCP-compatible client at this repo and the LLM can read/write the wiki programmatically.
The schema in CLAUDE.md is domain-agnostic. To adapt it:
- Purpose — Describe your knowledge domain in one paragraph
- Tagging taxonomy — Replace placeholder categories with your own (e.g., for a cooking KB:
cuisine,technique,ingredient,equipment) - Confidence levels — Adjust the descriptions to match your domain's evidence standards
- Entity types — Update the entity page description to match what entities mean in your domain (people, tools, companies, etc.)
- Journal template — Customize
wiki/journal/template.mdfor your workflow
Everything else — page format, linking conventions, workflows, rules — is universal and works across domains.
This template works for any knowledge-intensive topic:
- Research notes — papers, experiments, methodologies
- Book analysis — themes, characters, author techniques
- Competitive analysis — companies, products, market trends
- Course notes — lectures, readings, key concepts
- Personal development — frameworks, habits, book summaries
- Technical documentation — APIs, architectures, design patterns
- Hobby deep-dives — any subject you want to master
MIT