Researcher

AI-powered deep research agent using Google Gemini Deep Research API

Conduct autonomous multi-step research tasks and generate comprehensive, cited reports with references to academic papers, code repositories, and other resources.

Features

Three Research Modes: Undirected, Directed, or No-Research based on your materials
Autonomous Research: Multi-step research executed autonomously by Gemini
Structured Citations: Automatic extraction of arXiv IDs, DOIs, GitHub URLs
Streaming Progress: Real-time updates with agent thinking steps
Follow-up Questions: Continue research with contextual follow-ups
Configurable Prompts: All prompts externalized to configs/prompts.yaml

Quick Start

# Install dependencies
uv sync

# Set API key (get one from https://aistudio.google.com/)
export GOOGLE_API_KEY="your-api-key"

# Run research
researcher research "What are the latest advances in quantum computing?"

# Output:
# 🔍 Researching: What are the latest advances in quantum computing?
# 📁 Output: ./output/research
#
# ✓ Research completed!
# Duration: 245.3s
# Report: ./output/research/research_report.md

Installation

git clone <repo-url>
cd researcher
uv sync                    # Production
uv sync --extra dev        # Development

Research Modes

Choose how the agent conducts research based on available materials:

🌐 UNDIRECTED (Default)

Web-first discovery - Agent searches the web autonomously

researcher research "Latest AI trends" --mode undirected

Use when: You want comprehensive web-based research on a topic

📚 DIRECTED

User materials + web - Prioritize your materials, use web to fill gaps

researcher research "Compare these approaches" \
  --mode directed \
  -a paper1.pdf \
  -a https://arxiv.org/abs/2301.12345 \
  -a "Key findings from experiment X"

Use when: You have materials but need additional context or verification

🔬 NO-RESEARCH

Analysis only - Deeply analyze provided materials, no web search

researcher research "Synthesize findings across papers" \
  --mode no-research \
  -a paper1.pdf \
  -a paper2.pdf \
  -a paper3.pdf

Use when: You have all needed materials and want focused analysis

Configuration

API Key

Set your Google API key (priority order):

CLI: --api-key "your-key"
Environment: export GOOGLE_API_KEY="your-key"
File: .env or configs/research.yaml

Prompts

All prompts are externalized to configs/prompts.yaml for easy customization:

Available prompts:

default_output_format - Citation format and reference structure for main research
follow_up_system_prompt - System instructions for follow-up questions
research_mode_undirected - Web-first discovery instructions
research_mode_directed - Guided research with user materials prioritization
research_mode_no_research - Analysis-only mode instructions

To customize:

# Edit prompts without touching code
nano configs/prompts.yaml

# Changes take effect immediately
researcher research "Your query"

Benefits:

✅ No code editing required
✅ A/B test different prompt formats
✅ Version control your prompts
✅ Single source of truth

Usage

CLI

# Basic research (undirected mode)
researcher research "Your research query"

# Directed mode with materials
researcher research "Compare these architectures" \
  --mode directed \
  -a paper1.pdf \
  -a https://arxiv.org/abs/... \
  -o ./my_research

# No-research mode (analysis only)
researcher research "Synthesize findings" \
  --mode no-research \
  -a doc1.pdf -a doc2.pdf -a doc3.pdf \
  -v

# Common options:
#   --mode MODE          Research mode: undirected, directed, no-research
#   -a, --artifacts      Supporting materials (can use multiple times)
#   -o, --output DIR     Output directory (default: ./output)
#   -v, --verbose        Show thinking steps and preview
#   --format TEXT        Custom format instructions
#   --no-stream          Use polling instead of streaming
#   --max-wait SECONDS   Max wait time (default: 3600)
#   --api-key KEY        Google API key

# Get help
researcher --help
researcher research --help

Programmatic

from researcher import DeepResearcher, ResearchConfig, ResearchMode

# Basic usage (undirected mode)
researcher = DeepResearcher()
result = await researcher.research("What is quantum computing?")
print(result.report)

# Directed mode with artifacts
config = ResearchConfig(
    mode=ResearchMode.DIRECTED,
    artifacts=[
        "paper1.pdf",
        "https://arxiv.org/abs/2301.12345",
        "Key findings from experiment X"
    ],
    enable_streaming=True
)
researcher = DeepResearcher(config=config)
result = await researcher.research("Compare these transformer architectures")

# No-research mode (analysis only)
config = ResearchConfig(
    mode=ResearchMode.NO_RESEARCH,
    artifacts=["paper1.pdf", "paper2.pdf", "paper3.pdf"],
    output_format="Include comparison tables"
)
researcher = DeepResearcher(config=config)
result = await researcher.research("Synthesize findings across these papers")

# Save results
result.save("./output/research")

# Follow-up questions
answer = await researcher.follow_up(
    "Can you elaborate on error correction?",
    result.interaction_id
)

Convenience Function

from researcher import deep_research, ResearchMode

# Quick undirected research
result = await deep_research("Latest AI trends")

# With custom config
result = await deep_research(
    "Latest AI trends",
    mode=ResearchMode.DIRECTED,
    artifacts=["report.pdf", "https://arxiv.org/abs/..."],
    output_format="Executive summary with bullet points"
)

Output

What You Get

When you run a research query, you'll see:

🔍 Researching: What are the latest advances in quantum computing?
📁 Output: ./output/research

# Research progress (if --verbose)...

✓ Research completed!
Duration: 245.3s
Report: ./output/research/research_report.md

File Structure

output/research/
├── research_report.md       # Main report with structured citations
├── research_metadata.json   # Query, timing, interaction ID
└── thinking_steps.md        # Agent reasoning (only with -v flag)

Report Contents

Reports include comprehensive analysis with structured references organized by type:

Category	Identifiers Included
Published Papers	DOI, full citation, venue
Preprints	arXiv ID, bioRxiv, SSRN
Code Repositories	GitHub URLs, stars, description
Datasets	Hugging Face, Kaggle, format info
Websites & Docs	URLs, access dates
Books & Textbooks	ISBN, publisher, year
Videos	YouTube URLs, duration

All formatted in markdown tables with YAML metadata for easy programmatic extraction.

Development

# Run tests
uv run pytest
uv run pytest --cov=researcher

# Code quality
uv run ruff check src/
uv run ruff format src/
uv run mypy src/

Requirements

Python 3.11+
Google API key with Gemini access
Dependencies: google-genai, click, rich, pyyaml

Architecture

Uses Google's Gemini Deep Research Agent (deep-research-pro-preview-12-2025) via the Interactions API with:

Background execution for long-running tasks
Streaming for real-time progress updates
Automatic stream reconnection handling

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
configs		configs
docs		docs
src/researcher		src/researcher
test_artifacts		test_artifacts
test_output		test_output
test_v1		test_v1
.gitignore		.gitignore
README.md		README.md
USAGE.md		USAGE.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Researcher

Features

Quick Start

Installation

Research Modes

🌐 UNDIRECTED (Default)

📚 DIRECTED

🔬 NO-RESEARCH

Configuration

API Key

Prompts

Usage

CLI

Programmatic

Convenience Function

Output

What You Get

File Structure

Report Contents

Development

Requirements

Architecture

License

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

SIslamMun/Researcher

Folders and files

Latest commit

History

Repository files navigation

Researcher

Features

Quick Start

Installation

Research Modes

🌐 UNDIRECTED (Default)

📚 DIRECTED

🔬 NO-RESEARCH

Configuration

API Key

Prompts

Usage

CLI

Programmatic

Convenience Function

Output

What You Get

File Structure

Report Contents

Development

Requirements

Architecture

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages