ResearchPilot

AI deep research agent that plans, searches, verifies, and writes.

One query in. Structured, source-backed report out.

Getting Started · Architecture · Configuration · Roadmap

What It Does

You type a question. ResearchPilot builds a research plan, executes dozens of targeted web searches in parallel, scores every source for credibility, decides when to go deeper and when to stop, detects contradictions across findings, and assembles a structured report with inline citations.

No manual knobs. The AI determines scope based on your query — you just pick Quick, Thorough, or Deep.

How It Works

Query → Plan → Search → Extract → Assess → Repeat? → Report

Plan — A strategic LLM decomposes your query into 2-5 research aspects with sub-questions and priorities
Search — Parallel workers execute targeted queries across Firecrawl + Tavily
Extract — Findings are pulled from each source with claim, evidence, confidence, and source attribution
Assess — Every source gets a credibility score (domain heuristics + LLM assessment for unknowns)
Adapt — Information gain is measured after each round. Rich veins get explored deeper. Diminishing returns trigger early stopping
Detect — LLM-based contradiction detection flags conflicting claims across sources
Gap-fill — Coverage map identifies missing aspects, triggers up to 2 targeted follow-up rounds
Report — Outline → section-by-section generation → intro/conclusion → assembly with ranked sources

Architecture

flowchart LR
    Form["📝 Research Form\nQuick / Thorough / Deep"] --> Planner["🧠 Planner\nstrategic LLM"]
    Planner -->|research plan| W1["⚙️ Worker 1\naspect"] & W2["⚙️ Worker 2\naspect"]

    W1 & W2 -->|findings| State["📦 Research State\nimmutable"]
    State --> Pipeline["📄 Report Pipeline\noutline → sections → assemble"]
    Pipeline --> Report(["✅ Structured Report"])

    W1 & W2 -. search .-> Search["🔍 Firecrawl + Tavily"]
    W1 & W2 -. score .-> Cred["🏅 Credibility Scoring"]
    W1 & W2 -. gain .-> Gain["📊 Info Gain / Stop"]
    W1 & W2 -. LLM .-> Models["⚡ fast · smart · strategic"]

    Pipeline -. SSE stream .-> Form

Key Design Decisions

Immutable state — Research state is a pure data structure. Every mutation returns a new object. No side effects, easy to serialize, easy to debug.
Adaptive depth — Information gain is measured after each search round. High novelty → keep going. Low novelty → stop. No wasted API calls.
Tiered models — Fast/cheap models handle extraction (~70% of LLM calls). Smart models write the report. Strategic models plan. Cuts cost 40-60%.
Abort-aware pipeline — Client disconnect kills all in-flight LLM calls and searches via AbortSignal propagation. No orphaned API spend.
Structured outputs — Every LLM call uses Zod schemas via the Vercel AI SDK's Output.object. Type-safe from prompt to response.
Collect-then-merge — Parallel workers return results independently. The orchestrator merges them sequentially after all settle — no race conditions.

Getting Started

Prerequisites

Bun v1.2+
A Firecrawl API key (search + scraping)
An AI provider key: OpenRouter, OpenAI, or Groq

Install

git clone https://github.com/ajayyAI/researchpilot.git
cd researchpilot
bun install
cp .env.example .env

Configure

Edit .env with your API keys:

FIRECRAWL_API_KEY=fc-...        # required — web search + scraping
AI_PROVIDER=openrouter           # openai | openrouter | groq
OPENROUTER_API_KEY=sk-or-...    # key for your chosen provider

Run

bun dev

Open http://localhost:3000. Type a research topic. Pick an effort level. Go.

Configuration

Required

Variable	Description
`FIRECRAWL_API_KEY`	Firecrawl API key for web search + scraping
`AI_PROVIDER`	`openai`, `openrouter`, or `groq`
Provider key	`OPENAI_API_KEY`, `OPENROUTER_API_KEY`, or `GROQ_API_KEY`

Model Tiers (Optional)

Override the default model for each tier to optimize cost vs. quality:

Variable	Tier	Used For	Example
`AI_MODEL`	Default	Fallback for all tiers	`gpt-4.1`
`AI_MODEL_FAST`	Fast	Extraction, source scoring, gain assessment	`gpt-4.1-mini`
`AI_MODEL_STRATEGIC`	Strategic	Planning, query decomposition	`o4-mini`

Search Providers (Optional)

Variable	Description
`TAVILY_API_KEY`	Adds Tavily as a second search source
`TAVILY_SEARCH_DEPTH`	`basic` (default) or `advanced`
`FIRECRAWL_CONCURRENCY`	Parallel search limit (default: `2`, increase with paid plan)

Effort Levels

Users don't configure breadth or depth directly. They pick an effort level and the AI handles the rest:

Level	What Happens	Typical Time	Best For
Quick	2 parallel queries, 1 depth level	30-60s	Fact-checks, simple lookups
Thorough	4 parallel queries, 2 depth levels	2-4 min	Standard research
Deep	7 parallel queries, 3 depth levels	5-10 min	Complex multi-faceted analysis

Tech Stack

Layer	Technology
Framework	Next.js 16 with Turbopack
Language	TypeScript 5 (strict)
AI	Vercel AI SDK v6 with Zod structured outputs
Search	Firecrawl + Tavily (dual-source, deduped)
UI	React 19, Tailwind CSS 4, Radix UI
Streaming	Server-Sent Events with structured event types
Lint	Biome
Runtime	Bun

Project Structure

src/
├── app/
│   ├── api/research/
│   │   ├── route.ts              # SSE streaming endpoint
│   │   └── feedback/route.ts     # Clarification questions
│   └── page.tsx
├── components/
│   ├── home-page.tsx             # Main page — SSE client, state machine
│   ├── research-form.tsx         # Query input + effort selector
│   ├── research-plan.tsx         # Plan display card
│   ├── research-progress.tsx     # Live progress tracker + log
│   └── research-report.tsx       # Markdown report with copy/download
└── lib/research/
    ├── engine.ts                 # Orchestrator — parallel workers, gap analysis
    ├── workers.ts                # Aspect-level research with adaptive depth
    ├── planner.ts                # Research plan generation (strategic LLM)
    ├── report-pipeline.ts        # Outline → sections → assembly (smart LLM)
    ├── credibility.ts            # Source credibility scoring
    ├── gain.ts                   # Information gain assessment
    ├── search.ts                 # Multi-provider search with dedup + retry
    ├── state.ts                  # Immutable research state
    ├── providers.ts              # Tiered model selection
    ├── types.ts                  # Zod schemas + TypeScript types
    ├── prompts.ts                # System prompts
    └── text-utils.ts             # Token counting, text splitting

How It Compares

Feature	ResearchPilot	Deep-Research (Aomni)	GPT-Researcher	Perplexica
Research planning	AI-generated	None	Multi-agent	None
Adaptive depth	Information-gain driven	Fixed halving	Fixed	Fixed
Source credibility	Heuristic + LLM	None	None	None
Contradiction detection	LLM-based	None	None	None
Report pipeline	Outline → sections	Single pass	Multi-agent review	Single pass
Tiered models	fast / smart / strategic	Single	Three-tier	Single
Multi-source search	Firecrawl + Tavily	Firecrawl	Tavily	SearXNG
Streaming	SSE with typed events	SSE	WebSocket	WebSocket
Self-hosted	Yes	Yes	Yes	Yes

Roadmap

Development

bun dev              # Dev server with Turbopack
bun run type-check   # TypeScript type checking
bun run ci           # Biome lint + format check
bun run check        # Auto-fix lint/format issues
bun run build        # Production build
bun run validate     # type-check + lint + build

Contributing

Contributions welcome. Open an issue first to discuss what you'd like to change.

Fork the repo
Create a feature branch (git checkout -b feat/your-feature)
Run bun run validate before committing
Open a pull request

License

MIT

Built by ajayyAI

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.github/workflows		.github/workflows
public		public
src		src
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
biome.json		biome.json
bun.lock		bun.lock
components.json		components.json
next.config.ts		next.config.ts
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ResearchPilot

What It Does

How It Works

Architecture

Key Design Decisions

Getting Started

Prerequisites

Install

Configure

Run

Configuration

Required

Model Tiers (Optional)

Search Providers (Optional)

Effort Levels

Tech Stack

Project Structure

How It Compares

Roadmap

Development

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ResearchPilot

What It Does

How It Works

Architecture

Key Design Decisions

Getting Started

Prerequisites

Install

Configure

Run

Configuration

Required

Model Tiers (Optional)

Search Providers (Optional)

Effort Levels

Tech Stack

Project Structure

How It Compares

Roadmap

Development

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages