Financial Document Analyzer

A multi-agent system for analyzing financial documents (PDFs) using CrewAI and GPT-4. Upload a quarterly report or financial statement, and get back detailed analysis including verification, metrics extraction, investment recommendations, and risk assessment.

Screenshots

Web UI

Analysis Results

Swagger API Documentation

What it does

Reads PDF financial documents (10-K, quarterly reports, etc.)
Runs 4 AI agents that each specialize in different analysis
Stores results in PostgreSQL
Has a simple web UI + REST API

Setup

Clone and add your API keys to .env:

cp .env.example .env
# edit .env with your OPENAI_API_KEY and SERPER_API_KEY

Run with Docker:

docker-compose up --build

Open http://localhost:8000 and upload a PDF

That's it. The database tables get created automatically on first run.

How to use

Web UI

Go to http://localhost:8000, upload PDF, optionally add a specific question, click analyze.

API

curl -X POST http://localhost:8000/analyze \
  -F "file=@quarterly_report.pdf" \
  -F "query=What are the key financial risks?"

Response includes:

verification - document authenticity check
financial_analysis - key metrics and trends
investment_recommendations - buy/hold/sell thesis
risk_assessment - identified risks and mitigations

Check API health

curl http://localhost:8000/health

Architecture

4 CrewAI agents working sequentially:

Verifier - checks document is legit and complete
Financial Analyst - extracts metrics, analyzes trends
Investment Advisor - gives investment recommendation
Risk Assessor - identifies financial/operational risks

Results get saved to PostgreSQL (analysis and analysis_results tables).

Background jobs go through Celery + Redis.

Stack

Python 3.11
FastAPI
CrewAI 0.130.0
OpenAI GPT-4 Turbo
PostgreSQL 15
Redis (for Celery)
Docker

Files

main.py          - FastAPI app, API endpoints
agents.py        - CrewAI agent definitions  
task.py          - Task definitions for each agent
tools.py         - PDF reader tool
db_models.py     - SQLAlchemy models
celery_app.py    - Celery configuration
worker.py        - Celery worker
index.html       - Web UI

Bugs I fixed

The original code had several issues that I identified and fixed:

Import errors - CrewAI 0.130+ changed import paths (from crewai import Agent not from crewai.agents)
Tool pattern wrong - Had to use BaseTool from crewai.tools with _run() method
Agents couldn't find files - Fixed by pre-reading PDF content server-side and injecting into prompts
SQLAlchemy issues - JSONB import path changed, metadata is reserved keyword (renamed to result_metadata)
Missing validation - Added PDF header checks, file size limits, content-type verification
No database persistence - Added PostgreSQL storage for analysis results
Generic error handling - Added proper HTTP status codes (400, 413, 415, 500)
Circular LLM assignment - Original had llm=llm, fixed with proper ChatOpenAI initialization
Wrong parameter name - tool= changed to tools= (list of tools)
max_iter=1 - Agents gave up after 1 try, increased to reasonable values
Missing agent goal parameter - Verifier and Risk Assessor had syntax errors
PDFReader import missing - Added from llama_index.readers.file import PDFReader
Unprofessional backstories - Rewrote agent backstories to be more professional
Incomplete task descriptions - Rewrote task expected outputs with clear structure

Notes

PDF content is truncated to ~15KB before sending to agents (keeps costs down)
Each analysis takes 30-60 seconds depending on document size
Flower dashboard at http://localhost:5555 for monitoring Celery tasks

Environment variables

See .env.example for all options. Main ones:

OPENAI_API_KEY - your OpenAI key
SERPER_API_KEY - for web search (optional)
DATABASE_URL - PostgreSQL connection string

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Financial Document Analyzer

Screenshots

Web UI

Analysis Results

Swagger API Documentation

What it does

Setup

How to use

Web UI

API

Check API health

Architecture

Stack

Files

Bugs I fixed

Notes

Environment variables

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Screenshots		Screenshots
data		data
outputs		outputs
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
agents.py		agents.py
celery_app.py		celery_app.py
db_models.py		db_models.py
docker-compose.yml		docker-compose.yml
index.html		index.html
main.py		main.py
requirements.txt		requirements.txt
task.py		task.py
tasks.py		tasks.py
tools.py		tools.py
worker.py		worker.py

Folders and files

Latest commit

History

Repository files navigation

Financial Document Analyzer

Screenshots

Web UI

Analysis Results

Swagger API Documentation

What it does

Setup

How to use

Web UI

API

Check API health

Architecture

Stack

Files

Bugs I fixed

Notes

Environment variables

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages