Insight Engine

A true AI agent that synthesizes qualitative research transcripts into structured insights reports.

github.com/rackumar21/insight-engine

Why this is an agent, not just a script

Most "AI tools" make one API call: you give it input, it gives you output.

Insight Engine is different. It uses Claude's tool_use API to work step by step — planning its own analysis, reading files autonomously, checkpointing as it goes, and synthesizing across everything at the end. Claude decides the order of operations. If interrupted, it resumes from where it left off.

The agentic loop:

List files → Read each transcript → Extract + save themes → Load all themes → Synthesize → Write report

Each arrow is a tool call. Claude drives the process, not a hardcoded script.

What it does

Given a folder of interview or research transcripts, it produces:

Executive summary — 3-5 bullets for someone who has 30 seconds
Key themes — grouped by topic, with supporting quotes and participant counts
Tensions and contradictions — where participants disagreed (often the most interesting signals)
Underexplored areas — questions worth asking in follow-up research
Methodology notes — how many transcripts, any gaps

Stack


Language	Python 3.11+
AI	Anthropic Claude API (`claude-opus-4-6`)
Tool use	Claude's native tool_use API

Setup

git clone https://github.com/rackumar21/insight-engine
cd insight-engine
pip install -r requirements.txt
export ANTHROPIC_API_KEY=your_key_here

Usage

# Run on sample transcripts (included)
python agent.py --folder transcripts/sample

# Run on your own transcripts
python agent.py --folder path/to/your/transcripts

# Specify output location
python agent.py --folder transcripts/ --output my_report.md

Reports are saved to ./reports/insights_YYYY-MM-DD.md by default.

Resumability

Per-transcript theme extractions are cached in ./analysis/. If the agent is interrupted mid-run, re-running it skips already-analyzed transcripts and picks up from where it left off.

What I learned

Agents need checkpoints. A script that fails halfway through loses all its work. An agent that saves intermediate state can resume. The analysis/ cache was the single most important design decision.
Tool design is product design. Writing the tool schemas felt exactly like writing a PRD: define the interface, anticipate edge cases, make the inputs unambiguous. Claude's ability to use a tool correctly depends almost entirely on how well the schema describes what it does.
The reasoning is the output. Watching the agent work — seeing it decide to re-read a transcript before synthesizing, or group themes before writing — is more interesting than the report. The process reveals something about how good analysis actually works.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
transcripts/sample		transcripts/sample
.gitignore		.gitignore
README.md		README.md
agent.py		agent.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Insight Engine

Why this is an agent, not just a script

What it does

Stack

Setup

Usage

Resumability

What I learned

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Insight Engine

Why this is an agent, not just a script

What it does

Stack

Setup

Usage

Resumability

What I learned

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages