CognivAgent

Transform videos into searchable knowledge graphs

Built with Claude Agent SDK and OpenAI gpt-4o-transcribe

Quick Start | Features | Screenshots | Documentation | Join the Crew

Important Disclaimers

YouTube Content Policy

This project is NOT designed for downloading videos or audio from YouTube.

CognivAgent extracts transcripts only for analysis purposes. All video/audio content is automatically deleted during runtime after transcription completes.

Contributions that change this behavior will NOT be accepted.

Security Notice

This code is NOT ready for production deployment.

The application requires serious security analysis before being deployed beyond localhost.

Strong recommendation: Only run on localhost for development and research.

Quick Start

One-Line Install

curl -fsSL https://raw.githubusercontent.com/costiash/CognivAgent/main/install.sh | bash

Security Note: This pipes a remote script to your shell. We recommend reviewing install.sh first, or using the Manual Install below.

Manual Install

git clone https://github.com/costiash/CognivAgent.git
cd CognivAgent
uv sync

cp .env.example .env
# Add your API keys to .env:
#   ANTHROPIC_API_KEY=sk-ant-...
#   OPENAI_API_KEY=sk-...

uv run python -m app.main
# Open http://127.0.0.1:8000

Docker

docker-compose up -d
# Open http://localhost:8000

See the Docker Guide for detailed deployment options.

Features

Feature	Description
Video Transcription	Local videos and YouTube URLs via gpt-4o-transcribe with domain vocabulary prompts
Knowledge Graphs	Auto-bootstrap domain schemas, extract entities/relationships with source citations
Entity Resolution	Detect and merge duplicates using multi-signal similarity matching (Jaro-Winkler, alias overlap)
Chat Interface	Real-time activity streaming, Markdown rendering, dark/light themes
Graph Visualization	Interactive Cytoscape.js with search, type filtering, node inspector
Transcript Library	Save, search, export (TXT/JSON), and full-text viewer with highlighting
Audit Trail	Security blocking, tool usage logging, session lifecycle tracking
Background Jobs	Async queue with persistence, restart recovery, cancel/retry, step progress UI

Screenshots

Chat Interface

Real-time conversation with the AI agent, featuring activity streaming and Markdown rendering.

Dark Mode	Light Mode

Knowledge Graph Visualization

Interactive graph powered by Cytoscape.js with search, type filtering, and node inspector.

Dark Mode	Light Mode

Transcript Library

Save, search, and export transcripts with full-text viewer and highlighting.

Dark Mode	Light Mode

Demo Video

demo_video.mp4

5-minute full workflow: building a knowledge graph from multiple videos and querying it with agent skills

Knowledge Graph Workflow

Create Project  -->  Bootstrap  -->  Confirm Discoveries  -->  Extract  -->  Export

Create - Name your research topic
Bootstrap - First video auto-infers entity types and relationships
Confirm - Review and approve/reject discovered patterns
Extract - Subsequent videos use the confirmed schema
Export - Download as GraphML (Gephi, Neo4j, yEd) or JSON

Architecture

%%{init: {'theme': 'base', 'themeVariables': { 'primaryColor': '#818cf8', 'primaryTextColor': '#1e1b4b', 'primaryBorderColor': '#6366f1', 'lineColor': '#94a3b8', 'secondaryColor': '#f8fafc', 'tertiaryColor': '#f1f5f9'}}}%%
graph TB
    subgraph Frontend["🖥️ Frontend"]
        UI[Web UI<br/>37 ES Modules]
    end

    subgraph API["⚡ FastAPI Layer"]
        R1[Chat Router]
        R2[KG Router]
        R3[Jobs Router]
        R4[Transcripts Router]
        R5[Audit Router]
    end

    subgraph Services["🔧 Services"]
        SS[SessionService]
        KS[KnowledgeGraphService]
        JS[JobQueueService]
        AS[AuditService]
    end

    subgraph Core["🧠 Core"]
        SA[SessionActor]
        AGENT[Claude Agent<br/>+ MCP Tools]
    end

    subgraph External["🌐 External"]
        CLAUDE[Claude API]
        OPENAI[OpenAI API<br/>gpt-4o-transcribe]
    end

    UI --> R1
    UI --> R2
    UI --> R3
    UI --> R4
    UI --> R5
    R1 --> SS
    R2 --> KS
    R3 --> JS
    R5 --> AS
    SS --> SA
    SA --> AGENT
    AGENT --> CLAUDE
    AGENT --> OPENAI

    classDef frontend fill:#dbeafe,stroke:#3b82f6,stroke-width:2px,color:#1e40af
    classDef api fill:#dcfce7,stroke:#22c55e,stroke-width:2px,color:#166534
    classDef services fill:#f3e8ff,stroke:#a855f7,stroke-width:2px,color:#6b21a8
    classDef core fill:#ffedd5,stroke:#f97316,stroke-width:2px,color:#9a3412
    classDef external fill:#fee2e2,stroke:#ef4444,stroke-width:2px,color:#991b1b

    class UI frontend
    class R1,R2,R3,R4,R5 api
    class SS,KS,JS,AS services
    class SA,AGENT core
    class CLAUDE,OPENAI external

SessionActor Pattern - Queue-based actor model prevents Claude SDK cancel scope errors:

HTTP Request --> input_queue --> [SessionActor] --> response_queue --> Response

Tech Stack

Layer	Technologies
Backend	FastAPI, Uvicorn, Pydantic
AI	Claude Agent SDK, OpenAI gpt-4o-transcribe
Knowledge Graph	NetworkX, Cytoscape.js
Media	FFmpeg, pydub, yt-dlp
Frontend	ES Modules, Tailwind CSS, Marked.js, DOMPurify
Quality	mypy (strict), ruff, pytest (910 tests)

Project Structure

app/
├── api/routers/     # 9 endpoint routers
├── services/        # Session, Storage, KG, JobQueue, Audit services
├── core/            # SessionActor, config, cost tracking, audit hooks
├── agent/           # MCP tools + system prompts
├── kg/              # Domain models, graph storage, extraction
├── models/          # Pydantic schemas
├── static/js/       # 37 ES modules (chat, kg, jobs, upload, workspace)
└── templates/       # Jinja2 HTML

tests/               # 910 tests across 39 modules
data/                # Runtime storage

Development

uv run python -m app.main              # Dev server at http://127.0.0.1:8000
uv run pytest                          # Run all 910 tests
uv run mypy .                          # Type check (strict mode)
uv run ruff check . && ruff format .   # Lint + format

Configuration

Required:

ANTHROPIC_API_KEY=sk-ant-...    # Claude Agent SDK
OPENAI_API_KEY=sk-...           # gpt-4o-transcribe

Optional (for Claude Code development skills):

GEMINI_API_KEY=...              # Gemini 3 Flash skill

See CLAUDE.md for all configuration options.

Documentation

Guide	Description
Knowledge Graph	Entity extraction, graph visualization, domain bootstrapping
SDK Agent	Claude Agent SDK integration, MCP tools, hooks system
Frontend Architecture	37 ES modules, state management, UI patterns
Docker Deployment	Container setup, health checks, production
API Reference	All 9 routers, endpoints, request/response examples
Extending CognivAgent	Add tools, routers, modules, KG skills
CLAUDE.md	Development guidelines for AI assistants

🤖 Join the Crew

The Lonely Agent Initiative

The Agent has been grinding solo on rain-soaked rooftops, turning chaos into structure.
Every contribution makes the skyline a little less empty.

CognivAgent is actively seeking contributors! This is a community-driven project at the upgraded MVP stage. Whether you're fixing a typo or architecting a new feature, you're welcome on the rooftop.

The Mission Board

Difficulty	Tag	Description
⭐	`rooftop-welcome`	First-timer friendly. Quick wins to get you started.
🔧	`agent-needs-backup`	Agent needs backup on these.
🏗️	`skyline-feature`	New capabilities. Bigger scope, bigger impact.
🐛	`bug-in-the-rain`	Squash these. The Agent will appreciate it.
📝	`docs-update`	Documentation improvements. Every word counts.

Crew Ranks

Every contributor earns their place on the skyline:

Rank	Badge	How to Earn
Rooftop Visitor	🌧️	First PR merged
Rain Buddy	☔	3+ contributions
Skyline Regular	🏙️	Major feature or 10+ PRs
Agent's Partner	🤝	Core maintainer

Quick First Issues

Issue	Difficulty	Area
Add "copy to clipboard" button in transcript viewer	⭐	Frontend
Add keyboard shortcut for theme toggle (Ctrl/Cmd+D)	⭐	Frontend
Show transcript language in library list	⭐	Full Stack
Add transcript duration display in library	⭐	Full Stack

# Ready to join?
git clone https://github.com/costiash/CognivAgent.git
cd CognivAgent
uv sync
uv run pytest  # Make sure everything works

See CONTRIBUTING.md for the full briefing.

Roadmap

Every item below is an invitation to contribute. Whether you're looking for a quick win or a deep dive, pick something that interests you and open a PR!

Status	Meaning
Ready	Well-defined, ready to implement
Needs Design	Requires research or architecture decisions first
Future	Longer-term vision, open for discussion

Search & Discovery

Feature	Status	Description
Full-text transcript search	Ready	Backend indexing with SQLite FTS5 + frontend search UI
Semantic search with embeddings	Needs Design	Vector similarity for concept-based queries across transcripts
Cross-transcript entity search	Ready	Find all mentions of an entity across multiple videos

Export & Formats

Feature	Status	Description
Time-aligned SRT/VTT export	Ready	Generate subtitles with word-level timestamps from transcription
Speaker diarization	Needs Design	Identify and label different speakers in transcripts
Neo4j import scripts	Future	Generate Cypher queries for direct Neo4j import

Processing & Scale

Feature	Status	Description
Batch video processing	Ready	Queue multiple videos for sequential transcription
Playlist/channel import	Needs Design	Import entire YouTube playlists or channels
Parallel transcription	Future	Process multiple videos concurrently

AI & Analysis

Feature	Status	Description
Video frame analysis	Needs Design	Extract visual context (slides, diagrams) to enrich knowledge graphs
Context window optimization	Needs Design	Improve agent context efficiency for longer conversations
Evidence/provenance linking	Ready	Link KG nodes to source transcript timestamps
Multi-model extraction	Future	Use specialized models for different entity types

UI/UX Improvements

Feature	Status	Description
Graph panel enhancements	Ready	Better layout algorithms, node clustering, improved zoom/pan
Entity relationship explorer	Ready	Drill-down view for entity connections and paths
Mobile responsive design	Future	Full mobile experience for tablet/phone
Collaborative workspaces	Future	Share KG projects with team members

Developer Experience

Feature	Status	Description
Plugin architecture	Needs Design	Custom extractors and analyzers as plugins
REST API authentication	Ready	API keys and OAuth for programmatic access
Webhook notifications	Ready	Notify external services on job completion

License

Apache 2.0 - See LICENSE

Built with Claude Agent SDK and OpenAI gpt-4o-transcribe

Name		Name	Last commit message	Last commit date
Latest commit History 157 Commits
.claude		.claude
.github		.github
ai_docs		ai_docs
app		app
assets		assets
data		data
guides		guides
scripts		scripts
tests		tests
uploads		uploads
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
FRONTEND.md		FRONTEND.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
docker-compose.yml		docker-compose.yml
install.sh		install.sh
pyproject.toml		pyproject.toml
uv.lock		uv.lock

License

costiash/CognivAgent

Folders and files

Latest commit

History

Repository files navigation

CognivAgent

Important Disclaimers

YouTube Content Policy

Security Notice

Quick Start

One-Line Install

Manual Install

Docker

Features

Screenshots

Chat Interface

Knowledge Graph Visualization

Transcript Library

Demo Video

Knowledge Graph Workflow

Architecture

Tech Stack

Project Structure

Development

Configuration

Documentation

🤖 Join the Crew

The Mission Board

Crew Ranks

Quick First Issues

Roadmap

Search & Discovery

Export & Formats

Processing & Scale

AI & Analysis

UI/UX Improvements

Developer Experience

License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages