OpenDeepResearcher-API

A REST API service for conducting deep research on any topic using AI. This project is an adaptation of Matt Shumer's research notebook into a modular API service.

Requirements

Python 3.10
Virtual environment (recommended)
Ollama (optional, for local LLM support)

Overview

OpenDeepResearcher-API is a research assistant that:

Generates intelligent search queries
Performs parallel web searches
Evaluates content relevance
Extracts key information
Synthesizes findings into comprehensive reports
Provides real-time status updates during research

System Architecture

graph TB
    Client[Client Application]
    API[FastAPI Server]
    RE[Research Engine]
    LLM[LLM Provider]
    Search[SERPAPI]
    Content[Jina AI]
    DB[(Research Outputs)]
    
    Client -->|HTTP/SSE| API
    API -->|Research Request| RE
    RE -->|Query Generation| LLM
    RE -->|Web Search| Search
    RE -->|Content Extraction| Content
    RE -->|Text Generation| LLM
    RE -->|Save Report| DB
    
    subgraph LLM Providers
        LLM -->|Default| OpenRouter
        LLM -->|Option 1| OpenAI
        LLM -->|Option 2| Anthropic
        LLM -->|Option 3| Ollama
    end

Research Process Flow

sequenceDiagram
    participant C as Client
    participant A as API
    participant R as ResearchEngine
    participant L as LLM
    participant S as Search
    participant J as JinaAI
    
    C->>A: POST /api/research/stream
    activate A
    A->>R: Start Research Process
    activate R
    
    R->>L: Generate Search Queries
    L-->>R: Search Queries
    
    par Parallel Search
        R->>S: Execute Query 1
        R->>S: Execute Query 2
        R->>S: Execute Query 3
        R->>S: Execute Query 4
    end
    
    S-->>R: Search Results
    
    loop For Each URL
        R->>J: Extract Content
        J-->>R: Page Content
        R->>L: Evaluate Relevance
        L-->>R: Relevance Score
        R->>L: Extract Context
        L-->>R: Relevant Context
    end
    
    R->>L: Generate Report
    L-->>R: Final Report
    
    R-->>A: Research Complete
    deactivate R
    A-->>C: Stream Updates & Report
    deactivate A

Features

Multiple LLM Provider Support:
- OpenRouter (default)
- OpenAI
- Anthropic
- Ollama (local)
Intelligent Query Generation: Creates targeted search queries to explore different aspects of your topic
Parallel Web Search: Uses SERPAPI for efficient web searching
Content Processing: Uses Jina AI for webpage content extraction
Automated Research Process: Iteratively explores topics until sufficient information is gathered
Real-time Status Updates: Streams research progress using Server-Sent Events (SSE)
Markdown Report Generation: Saves research findings with full process logs

User Interface

The application provides a modern, intuitive interface for conducting deep research:

Research Form

The main research form allows you to:

Enter your research query
Configure research depth (number of iterations)
Start the research process with a single click

Research Progress

Real-time progress tracking shows:

Current research stage
Generated search queries
Processing status
Found links and their relevance
Iteration progress

The progress view includes:

A visual timeline of the research process
Color-coded status indicators
Progress bars for overall completion
Detailed logs of each step

Query Generation

The system generates intelligent search queries like:

"ARTIFICIAL INTELLIGENCE IN INSURANCE CLAIMS PROCESSING"
"INSURANCE COMPANY DENIAL RATE TRENDS WITH AI"
"IMPACT OF MACHINE LEARNING ON INSURANCE UNDERWRITING DECISIONS"
"EFFECTIVENESS OF NATURAL LANGUAGE PROCESSING IN INSURANCE DENIAL APPEALS"

Status Updates

The interface provides clear feedback with:

Success/error notifications
Connection status indicators
Progress animations
Detailed error messages when needed

Installation

Ensure you have Python 3.10 installed:

python --version  # Should show Python 3.10.x

Clone the repository:

git clone https://github.com/yourusername/OpenDeepResearcher-API.git
cd OpenDeepResearcher-API

Create a Python virtual environment:

python -m venv venv-odr-310  # Specifically using Python 3.10
source venv-odr-310/bin/activate  # On Unix/macOS
# or
.\venv-odr-310\Scripts\activate  # On Windows

Install dependencies:

pip install -r requirements.txt

Configuration

Copy the example environment file:

cp .env.example .env

Configure your environment variables in .env:

# Required API Keys
SERPAPI_API_KEY=your_serpapi_key
JINA_API_KEY=your_jina_key

# LLM Provider (choose one)
LLM_PROVIDER=openrouter  # Options: openrouter, openai, anthropic, ollama

# Provider-specific API Keys (only needed for chosen provider)
OPENROUTER_API_KEY=your_openrouter_key
OPENAI_API_KEY=your_openai_key
ANTHROPIC_API_KEY=your_anthropic_key

# Ollama Settings (only needed if using ollama)
OLLAMA_HOST=http://localhost:11434  # Default Ollama host
OLLAMA_MODEL=llama3.2  # Default model

Usage

Start the API server:

uvicorn app.main:app --reload

The API will be available at http://localhost:8000
API Endpoints:
- POST /api/research: Traditional synchronous research endpoint
- POST /api/research/stream: Stream research progress in real-time using SSE
- GET /api/health: Check API health

Example research request (traditional):

curl -X POST http://localhost:8000/api/research \
  -H "Content-Type: application/json" \
  -d '{"query": "Impact of quantum computing on cryptography", "max_iterations": 5}'

Example streaming request (real-time updates):

curl -N -H "Accept: text/event-stream" \
  -H "Content-Type: application/json" \
  -X POST http://localhost:8000/api/research/stream \
  -d '{"query": "Impact of quantum computing on cryptography", "max_iterations": 5}'

The streaming endpoint provides real-time updates on:

Research initialization
Query generation
Search execution
Content processing
Context extraction
Report generation
Final results

Understanding Stream Updates

The streaming endpoint returns Server-Sent Events (SSE) with the following structure:

data: {
    "type": "<update_type>",
    "message": "<descriptive_message>",
    // Additional data specific to the update type
}

Status update types include:

start: Research initialization
progress: General progress updates
queries: Generated search queries
iteration: Start of a new research iteration
links: Number of unique links found
processing: Currently processing URL
evaluation: Content relevance evaluation
context: Extracted relevant context
warning: Processing warnings or issues
error: Error notifications
complete: Final research results

Example stream output:

data: {"type": "start", "message": "Starting research: What are the benefits of meditation?"}

data: {"type": "progress", "message": "Generating initial search queries..."}

data: {
    "type": "queries",
    "message": "Generated initial queries",
    "queries": [
        "benefits of meditation for stress relief",
        "meditation and mental health",
        "how meditation improves focus and productivity",
        "meditation and its impact on anxiety"
    ]
}

data: {"type": "iteration", "message": "=== Iteration 1 ===", "iteration": 1}

data: {"type": "links", "message": "Found 30 unique links", "count": 30}

data: {
    "type": "processing",
    "message": "Processing: https://example.com/article",
    "url": "https://example.com/article"
}

data: {
    "type": "evaluation",
    "message": "Page usefulness: Yes",
    "url": "https://example.com/article",
    "useful": true
}

data: {
    "type": "context",
    "message": "Extracted context (preview): Research shows that meditation...",
    "url": "https://example.com/article"
}

data: {
    "type": "complete",
    "message": "Research completed successfully",
    "report": "Final research report...",
    "logs": ["Log entry 1", "Log entry 2"]
}

Client-side Handling

Example JavaScript code to handle the stream:

const eventSource = new EventSource('/api/research/stream');

eventSource.onmessage = (event) => {
    const data = JSON.parse(event.data);
    
    switch(data.type) {
        case 'start':
            console.log('Research started:', data.message);
            break;
        case 'queries':
            console.log('Search queries:', data.queries);
            break;
        case 'links':
            console.log('Found links:', data.count);
            break;
        case 'evaluation':
            console.log('Content evaluation:', data.message, data.useful);
            break;
        case 'complete':
            console.log('Research complete:', data.report);
            eventSource.close();
            break;
        default:
            console.log('Update:', data.message);
    }
};

eventSource.onerror = (error) => {
    console.error('Stream error:', error);
    eventSource.close();
};

LLM Provider Configuration

OpenRouter (Default)

LLM_PROVIDER=openrouter
OPENROUTER_API_KEY=your_key
OPENROUTER_MODEL=meta-llama/llama-3-8b-instruct:free

OpenAI

LLM_PROVIDER=openai
OPENAI_API_KEY=your_key
OPENAI_MODEL=o1

Anthropic

LLM_PROVIDER=anthropic
ANTHROPIC_API_KEY=your_key
ANTHROPIC_MODEL=claude-3-haiku-20240307

Ollama (Local)

LLM_PROVIDER=ollama
OLLAMA_HOST=http://localhost:11434
OLLAMA_MODEL=llama3.2

Testing with Ollama Locally

Install Ollama:

# macOS or Linux
curl -fsSL https://ollama.com/install.sh | sh

# Windows
# Download from https://ollama.com/download

Start the Ollama service:

ollama serve

Pull your desired model (e.g., Llama 2):

ollama pull llama3.2

Configure your .env file for Ollama:

# Required API Keys (still needed for web search and content extraction)
SERPAPI_API_KEY=your_serpapi_key
JINA_API_KEY=your_jina_key

# Set Ollama as the LLM provider
LLM_PROVIDER=ollama

# Ollama Configuration
OLLAMA_HOST=http://localhost:11434
OLLAMA_MODEL=llama2

Start the API server:

uvicorn app.main:app --reload

Test the research endpoint:

curl -X POST http://localhost:8000/api/research \
  -H "Content-Type: application/json" \
  -d '{"query": "What are the main features of the Llama 2 model?", "max_iterations": 2}'

Available Ollama models:

llama3.2 - General purpose model
mistral - Powerful open-source model

To use a different model, update OLLAMA_MODEL in your .env file and ensure you've pulled the model with ollama pull model_name.

Development

Project Structure

graph TD
    Root[OpenDeepResearcher-API] --> App[app/]
    Root --> Outputs[research_outputs/]
    Root --> Config[Configuration Files]
    Root --> Docs[Documentation]
    
    App --> Init[__init__.py<br/>Version Info]
    App --> Main[main.py<br/>FastAPI App]
    App --> Researcher[researcher.py<br/>Core Engine]
    App --> LLM[llm_providers.py<br/>LLM Integration]
    App --> Conf[config.py<br/>Settings]
    
    Config --> Env[.env<br/>Environment Variables]
    Config --> EnvExample[.env.example<br/>Template]
    Config --> Reqs[requirements.txt<br/>Dependencies]
    
    Docs --> README[README.md<br/>Documentation]
    Docs --> License[LICENSE<br/>MIT License]
    
    Outputs --> Reports[Research Reports<br/>.md Files]

Data Flow

flowchart TD
    Query[User Query] --> Engine[Research Engine]
    
    Engine --> QueryGen[Query Generation]
    QueryGen --> Search[Web Search]
    
    Search --> URLs[URL Collection]
    URLs --> Content[Content Extraction]
    
    Content --> Relevance[Relevance Check]
    Relevance --> Context[Context Extraction]
    
    Context --> Analysis[Information Analysis]
    Analysis --> Report[Report Generation]
    
    Report --> Save[Save to Markdown]
    Report --> Stream[Stream to Client]

Acknowledgments

This project is based on Matt Shumer's original OpenDeepResearcher implementation. The original work has been adapted into a REST API service with additional features like multi-provider LLM support, parallel processing, and real-time status updates.

License

This project is licensed under the MIT License - see the LICENSE file for details.

TODO and Roadmap

Planned Migrations

Content Extraction - [In Progress]
- Migrate from Jina.ai to our own extraction stack
- See JINA_MIGRATION.md for detailed plan
- Implement multiple extraction methods (Trafilatura, Newspaper3k, BeautifulSoup)
- Add content caching for improved performance
Web Search - [Planned]
- Migrate from SERPAPI to direct search implementations
- Support multiple search engines (Google, Bing, DuckDuckGo)
- Implement rate limiting and proxy rotation
- Add search result caching

Feature Enhancements

Response Formatting
- Add support for multiple output formats (MD, PDF, HTML)
- Implement custom templates for research reports
- Add citation support and bibliography generation
- Include source credibility scoring
User Interface
- Develop a web-based dashboard
- Add real-time research progress visualization
- Implement interactive report editing
- Add source material management
Research Quality
- Implement fact-checking against multiple sources
- Add source credibility scoring
- Improve context relevance detection
- Enhance duplicate content detection
Performance Optimizations
- Implement distributed processing for large research tasks
- Add result caching system
- Optimize memory usage during large operations
- Improve concurrent request handling

Infrastructure

Monitoring and Logging
- Add detailed performance metrics
- Implement error tracking and reporting
- Add system health monitoring
- Set up automated alerts
Testing
- Expand unit test coverage
- Add integration tests
- Implement performance benchmarks
- Add automated quality checks
Documentation
- Add API reference documentation
- Create developer guides
- Add more code examples
- Create troubleshooting guides
Deployment
- Add Docker support
- Create Kubernetes configurations
- Set up CI/CD pipelines
- Add automated deployment scripts

Security

Authentication and Authorization
- Implement user authentication
- Add role-based access control
- Add API key management
- Implement rate limiting
Data Protection
- Add data encryption at rest
- Implement secure credential storage
- Add audit logging
- Implement backup and recovery

Community Features

Collaboration
- Add shared research projects
- Implement commenting system
- Add version control for reports
- Create research templates
Integration
- Add webhook support
- Create plugin system
- Add export to various platforms
- Implement sharing features

To contribute to any of these improvements, please check our CONTRIBUTING.md guide.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
app		app
docs		docs
frontend		frontend
.env.example		.env.example
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
run.py		run.py

License

jwjohns/OpenDeepResearcher-API

Folders and files

Latest commit

History

Repository files navigation

OpenDeepResearcher-API

Requirements

Overview

System Architecture

Research Process Flow

Features

User Interface

Research Form

Research Progress

Query Generation

Status Updates

Installation

Configuration

Usage

Understanding Stream Updates

Client-side Handling

LLM Provider Configuration

OpenRouter (Default)

OpenAI

Anthropic

Ollama (Local)

Testing with Ollama Locally

Development

Project Structure

Data Flow

Acknowledgments

License

TODO and Roadmap

Planned Migrations

Feature Enhancements

Infrastructure

Security

Community Features

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages