video-processing-ai

Screen.Recording.2026-01-18.at.14.02.11.mov

A simple video analysis chat demo where users can upload a short video and ask questions about it using AI.

Key Technical Decisions

Browser-Based Video Processing (No FFmpeg)

We chose to process videos in the browser using native Web APIs instead of relying on FFmpeg binaries:

Why not FFmpeg?

External binary dependency requires system-level installation
Not portable across different OS/environments
Complicates deployment and CI/CD pipelines
Makes the monorepo not self-contained

Our approach:

Use HTMLVideoElement for video loading and duration validation
Use Canvas API for frame extraction
Send base64-encoded frames directly to backend
Zero external dependencies - works anywhere JavaScript runs

Trade-offs:

Browser must support video codec (most modern browsers do)
Processing happens client-side (acceptable for short videos up to 20s)
Larger payload size (base64 frames vs video file)

Video Duration Limit (20 seconds)

Videos are limited to 20 seconds maximum. This is enforced at two levels:

Frontend validation:

HTMLVideoElement.duration is checked before processing
Videos over 20 seconds are rejected immediately

Backend validation:

Frames are extracted at 1 frame per second, capped at 10 frames
Backend rejects requests with more than 10 frames
This provides a secondary check: 10 frames × 1fps = 10 seconds of coverage
Even if frontend is bypassed, backend limits the data processed

This dual validation ensures the limit is respected regardless of how the API is called.

Conversation Context

The chat maintains full conversation history for contextual follow-up questions:

Backend stores chat history per session (in-memory)
Each request to OpenAI includes previous messages
Users can ask follow-up questions like "What color was that?" or "Tell me more about the person"
Context is preserved until the session ends (page refresh)

This enables natural multi-turn conversations about the video content.

Architecture

Frontend (React + Tailwind)     Backend (Fastify)
       │                              │
       │ 1. Load video                │
       │ 2. Validate duration         │
       │ 3. Extract frames (canvas)   │
       │                              │
       │──── POST /api/session ──────>│ 4. Store frames in memory
       │<─── sessionId ───────────────│
       │                              │
       │──── POST /api/chat ─────────>│ 5. Send frames to OpenAI
       │<─── AI response ─────────────│ 6. Return analysis

Running Locally

Prerequisites

Bun v1.0+
OpenAI API key

Setup

Clone and install:

git clone https://github.com/inewlegend/video-processing-ai
cd video-processing-ai
bun install

Configure environment:

cp .env-sample .env
# Edit .env and add your OPENAI_API_KEY

Start both servers:

bun run backend:dev   # Terminal 1 - runs on :3000
bun run frontend:dev  # Terminal 2 - runs on :3020

Open http://localhost:3020

Usage

Upload a video (max 20 seconds)
Wait for processing
Ask questions about the video in the chat

Project Structure

video-processing-ai/
├── apps/
│   ├── frontend/          # React + Tailwind + shadcn
│   │   ├── src/
│   │   │   ├── components/  # UI components
│   │   │   ├── services/    # API and video processing
│   │   │   └── app.tsx      # Main app
│   │   └── index.html
│   └── backend/           # Fastify API
│       └── src/
│           ├── routes/      # API endpoints
│           ├── services/    # OpenAI integration
│           └── types/       # TypeScript types
├── .env-sample            # Environment template
└── package.json           # Monorepo scripts

Environment Variables

Variable	Description	Default
`OPENAI_API_KEY`	OpenAI API key	(required)
`BACKEND_HOST`	Backend host	localhost
`BACKEND_PORT`	Backend port	3000
`FRONTEND_HOST`	Frontend host	localhost
`FRONTEND_PORT`	Frontend port	3020

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.cursor		.cursor
apps		apps
assets		assets
.cursorrules		.cursorrules
.editorconfig		.editorconfig
.env-sample		.env-sample
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
TASK.md		TASK.md
bun.lock		bun.lock
index.ts		index.ts
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

video-processing-ai

Key Technical Decisions

Browser-Based Video Processing (No FFmpeg)

Video Duration Limit (20 seconds)

Conversation Context

Architecture

Running Locally

Prerequisites

Setup

Usage

Project Structure

Environment Variables

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

video-processing-ai

Key Technical Decisions

Browser-Based Video Processing (No FFmpeg)

Video Duration Limit (20 seconds)

Conversation Context

Architecture

Running Locally

Prerequisites

Setup

Usage

Project Structure

Environment Variables

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages