An intelligent multi-agent research platform that performs automated research, claim extraction, contradiction detection, verification, and confidence scoring using a hybrid architecture combining LLM agents, semantic retrieval, and local ML models.
This project is an AI-powered research assistant system that:
- Breaks complex questions into research tasks
- Searches and analyzes sources
- Extracts structured factual claims
- Detects contradictions across sources
- Verifies evidence using hybrid ML + LLM reasoning
- Assigns confidence scores
- Generates explainable research reports
The system combines:
- Multi-agent orchestration
- Retrieval-augmented reasoning
- Semantic vector search
- Transformer-based stance classification
- Structured knowledge storage
Specialized agents collaborate to solve research tasks:
-
Planner Agent
- Breaks queries into sub-questions
-
Research Agent
- Retrieves and summarizes evidence
-
Claim Extraction Agent
- Converts text into structured factual claims
-
Verification Agent
- Evaluates evidence and claim validity
-
Writer Agent
- Synthesizes structured final reports
Claims are stored in ChromaDB with embeddings for:
- Semantic retrieval
- Topic clustering
- Contradiction analysis
- Persistent research memory
The system identifies:
- Supporting claims
- Opposing claims
- Uncertain claims
using:
- Embedding similarity
- Transformer-based NLI models
- LLM reasoning
The platform combines:
- Local transformer inference
- Embedding similarity scoring
- LLM-based nuanced reasoning
instead of relying entirely on LLM prompts.
The system exposes:
- Agent execution traces
- Evidence sources
- Confidence reasoning
- Contradiction analysis
to improve transparency and interpretability.
Frontend (React / Next.js)
β
FastAPI Backend
β
Agent Orchestrator
β
Agents:
βββ Planner Agent
βββ Research Agent
βββ Claim Extraction Agent
βββ Verification Agent
βββ Writer Agent
β
ML Intelligence Layer
βββ Embedding Similarity
βββ Stance Classification
βββ Confidence Scoring
β
ChromaDB Vector Store
β
External Search + Scraping
- Python
- FastAPI
- Google Agent Development Kit (ADK)
- React / Next.js
- Gemini API
- Hugging Face Transformers
- PyTorch
- Sentence Transformers
- ChromaDB
- Tavily API
- BeautifulSoup
- newspaper3k
- Docker
- Docker Compose
- Nginx
- AWS EC2
- GitHub Actions CI/CD
The Planner Agent decomposes the user query into focused research tasks.
Example:
{
"tasks": [
{
"id": 1,
"question": "What are the benefits of AI in software engineering?"
},
{
"id": 2,
"question": "What risks does AI introduce to software engineering jobs?"
}
]
}The Research Agent:
- searches the web
- retrieves sources
- extracts relevant evidence
The Claim Extraction Agent converts evidence into structured claims.
Example:
{
"claim": "AI automation is reducing demand for junior software engineers.",
"source_url": "...",
"claim_type": "prediction"
}Claims are embedded and stored in ChromaDB for:
- retrieval
- similarity search
- contradiction analysis
The system evaluates:
- supporting evidence
- contradicting evidence
- semantic consistency
- stance relationships
using:
- embedding similarity
- NLI transformers
- LLM reasoning
The Writer Agent generates:
- executive summaries
- key findings
- contradiction reports
- confidence explanations
Used for:
- semantic retrieval
- claim clustering
- contradiction candidate filtering
Model:
all-MiniLM-L6-v2
Used to determine whether evidence:
- supports
- opposes
- is neutral toward claims
Implemented using:
DeBERTa / MNLI-based transformer models
The system includes:
- Claim deduplication
- Topic clustering
- Batched embeddings
- Async research execution
- Multi-layer caching
- Similarity filtering before reasoning
- Reduced token usage via evidence snippets
The backend is deployed using:
- Docker
- Docker Compose
- Nginx reverse proxy
- AWS EC2
CI/CD is automated via GitHub Actions.
Deployment flow:
Push to GitHub
β
GitHub Actions
β
SSH into EC2
β
Docker rebuild + restart
β
Automatic deployment
Example:
GEMINI_API_KEY=
TAVILY_API_KEY=
CHROMA_DB_PATH=cd backend
pip install -r requirements.txt
uvicorn main:app --reloadcd frontend
npm install
npm run devdocker compose up --buildWill AI replace software engineers?
The system:
- breaks the question into sub-questions
- gathers evidence
- extracts claims
- retrieves semantically related claims
- detects contradictions
- classifies stance relationships
- assigns confidence scores
- generates a structured research report
Potential future features:
- Debate agents
- Persistent long-term memory
- Live collaborative research
- Streaming reasoning traces
- User-specific research profiles
- Autonomous follow-up questioning
MIT License
Built as an experimental AI research and reasoning platform exploring:
- agentic systems
- retrieval-augmented generation
- structured AI reasoning
- hybrid ML + LLM architectures