knowledge-extractor

Interviews engineers and tech leads to extract design intent, business logic, and operational knowledge from code. Works with any content — docs, specs, runbooks — but built code-first.

IP: Killawot Limited | License: MIT

The Problem

Business knowledge lives in people's heads. When someone asks "why does X work this way?", the answer requires:

Reading code to understand what happens
Asking the SME to understand why it was designed that way
Documenting the answer so it doesn't have to be asked again

This tool automates steps 1 and 3, and structures step 2 into an efficient interview.

How It Works

Point it at a folder (code, docs, or both)
It scans the content and generates a topic-by-topic interview plan
You answer questions in Claude Code (voice-optimized via Handy/Dragon)
It drafts structured markdown documentation in real-time
You review and approve each doc

Tools (MCP)

Tool	Description
`prepare_interview`	Scan folder → generate interview plan + session ID
`conduct_interview`	Run structured Q&A for each topic
`generate_documentation`	Produce final markdown docs from transcript
`review_documentation`	Approve or request changes per doc

Installation

npm install
npm run build

Add to your Claude Code MCP config (~/.claude/settings.json):

{
  "mcpServers": {
    "knowledge-extractor": {
      "command": "node",
      "args": ["/path/to/knowledge-extractor/dist/index.js"],
      "env": {
        "ANTHROPIC_API_KEY": "your-key-here",
        "KNOWLEDGE_EXTRACTOR_IDE": "cursor"
      }
    }
  }
}

KNOWLEDGE_EXTRACTOR_IDE controls the editor link format in interview questions. Valid values: cursor (default), vscode, none.

Usage

1. Prepare interview

prepare_interview(
  source_path: "/path/to/my-api/src",
  source_type: "code",
  output_path: "/path/to/output/docs",
  focus_areas: ["versioning", "filtering", "sync"]
)

Returns a session ID and interview plan with estimated duration per topic.

2. Conduct interview

conduct_interview(
  session_id: "<session-id>",
  auto_draft: true,
  read_back: true
)

The agent asks prepared questions one at a time. Answer each, say skip to skip, done to finish the topic.

3. Generate docs

generate_documentation(
  session_id: "<session-id>"
)

Produces markdown files in the output_path for each topic.

4. Review

review_documentation(
  session_id: "<session-id>",
  doc_path: "/path/to/output/docs/TOPIC_WHY.md",
  feedback: "Add that REVIEWED→PUBLISHED requires final approval"
)

Omit feedback to approve as-is.

Voice Input

Interviews are designed for voice — speak answers naturally using:

Handy (Mac) — real-time transcription into Claude Code chat
Dragon — professional STT
macOS dictation — built-in, Fn+Fn to activate

The voice layer is external — knowledge-extractor works with anything that types into a text input.

Why voice: A 60-90 min typed interview becomes 15-20 min spoken with higher detail and natural flow.

Document Templates

Template	Use For
`WHY`	Design decisions, architecture rationale
`HOW`	Step-by-step processes, algorithms
`RULES`	Business rule catalogs
`FLOW`	User journeys, workflows, sequence diagrams

Templates are in src/templates/ and can be customised.

Local Dev

# Run test harness against this repo's own source
node --import tsx test-agent.ts

# Run against a specific folder
node --import tsx test-agent.ts /path/to/source /path/to/output

Roadmap

Phase 2: Claude-powered question generation from file contents
Phase 2: Claude-powered documentation synthesis from transcript
Phase 3: Markdown/PDF scanner improvements
Phase 4: CI + public release
Phase 5: cloud.killawot.ai hosted version

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
src		src
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
test-agent.ts		test-agent.ts
tsconfig.json		tsconfig.json
tsconfig.tsbuildinfo		tsconfig.tsbuildinfo

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

knowledge-extractor

The Problem

How It Works

Tools (MCP)

Installation

Usage

1. Prepare interview

2. Conduct interview

3. Generate docs

4. Review

Voice Input

Document Templates

Local Dev

Roadmap

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

knowledge-extractor

The Problem

How It Works

Tools (MCP)

Installation

Usage

1. Prepare interview

2. Conduct interview

3. Generate docs

4. Review

Voice Input

Document Templates

Local Dev

Roadmap

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages