🎬 OpenScenes

AI-Powered Video Presentation Engine

Transform ideas into stunning video presentations with AI-driven generation and professional rendering.

Gemini 3

✨ Features

AI Generation — Multi-agent pipeline transforms prompts into complete presentations
Intelligent Editing — Natural language editing at global, slide, and element levels
Video Export — Distributed render engine with Remotion
Theme Engine — 10+ curated visual themes with AI-aware styling
File Context — Upload documents (PDF, MD, CSV) for AI-informed generation
Real-time Preview — Interactive canvas with drag-and-drop editing

Tech Stack

Layer	Technologies
Frontend	Next.js 15, React 19, TypeScript, Framer Motion
AI	Google Gemini 2.0, Vertex AI (Imagen 3)
Video	Remotion 4, WebCodecs API
Infrastructure	RabbitMQ, Redis, PostgreSQL, MinIO (S3)
DevOps	Docker, Docker Compose

Tiered Edit System

Natural language editing at three granularity levels:

flowchart LR
    subgraph "Edit Levels"
        L1["- Global<br/>All slides"]
        L2["- Slide<br/>Single slide"]
        L3["- Element<br/>Selected items"]
    end
    
    L1 --> Classifier
    L2 --> Classifier
    L3 --> Classifier
    
    Classifier[AI Classifier] --> Patches[JSON Patches]
    Patches --> Apply[Deep Merge]
    Apply --> Result[Updated Slides]

Examples:

"Make all headlines use brand color #6366f1"
"Convert this slide to a two-column comparison"
"Make the selected text larger and bold"

Architecture

graph TB
    subgraph Frontend
        UI[Next.js App]
    end
    
    subgraph Workers
        AI[AI Worker]
        Render[Render Worker]
    end
    
    subgraph Infrastructure
        RMQ[(RabbitMQ)]
        Redis[(Redis)]
        PG[(PostgreSQL)]
        S3[(MinIO)]
    end
    
    subgraph External
        Gemini[Google Gemini]
    end
    
    UI --> RMQ
    RMQ --> AI
    RMQ --> Render
    AI --> Gemini
    AI --> S3
    Render --> S3

Event-Driven Architecture

The system uses a distributed, event-driven architecture for scalable async processing:

Component	Technology	Purpose
Message Queue	RabbitMQ	Decouples API from workers, enables horizontal scaling
Job Status	Redis	Real-time status polling, pub/sub for cancellation
Workers	Node.js	Stateless consumers that process AI/render jobs
Storage	MinIO (S3)	Object storage for assets and rendered videos

sequenceDiagram
    participant Client
    participant API
    participant RabbitMQ
    participant Worker
    participant Redis
    
    Client->>API: POST /api/ai/generate
    API->>RabbitMQ: Publish job
    API->>Redis: Set status: queued
    API-->>Client: { jobId }
    
    RabbitMQ->>Worker: Consume job
    Worker->>Redis: Update: processing
    Worker->>Worker: Execute AI pipeline
    Worker->>Redis: Update: completed
    
    Client->>API: GET /api/ai/status/{jobId}
    API->>Redis: Get status
    API-->>Client: { status, result }

AI Pipeline

flowchart LR
    Query[User Query] --> Summarizer
    Summarizer --> Director
    Director --> Assets[Asset Generator]
    Director --> Slides[Slide Generator]
    Assets --> Integration
    Slides --> Integration
    Integration --> Validator
    Validator --> Output[Final Slides]

Agent	Model	Purpose
Summarizer	Gemini Flash Lite	Extract intent and key points
Director	Gemini Flash	Plan narrative and visual style
Slide Generator	Gemini Flash	Generate slide JSON

Project Structure

├── app/                    # Next.js App Router
│   ├── api/ai/            # AI endpoints
│   ├── api/render/        # Render endpoints
│   └── generate/          # Dashboard & editor
├── worker/
│   ├── ai/                # AI generation worker
│   └── render/            # Video render worker
├── remotion/              # Video composition
└── docker-compose.yml     # Full stack setup

Quick Start

# Install dependencies
npm install

# Start infrastructure
docker-compose up -d

# Setup environment
cp .env.example .env.local

# Setup Google Cloud Credentials (required for Vertex AI)
# Place your service-account.json in the root directory
# OR set GOOGLE_APPLICATION_CREDENTIALS in .env to your local path

# Run development server
npm run dev

# Run workers (separate terminals)
npm run worker:ai
npm run worker:render

Editor

Documentation

** Built with Next.js, Remotion, and Google Gemini**

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.github/assets		.github/assets
app		app
docs		docs
lib		lib
public		public
remotion		remotion
scripts		scripts
worker		worker
.example.env		.example.env
.gitignore		.gitignore
Dockerfile		Dockerfile
Justfile		Justfile
README.md		README.md
docker-compose-infra.yml		docker-compose-infra.yml
docker-compose.yml		docker-compose.yml
eslint.config.mjs		eslint.config.mjs
middleware.ts		middleware.ts
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
remotion.config.ts		remotion.config.ts
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎬 OpenScenes

AI-Powered Video Presentation Engine

Gemini 3

✨ Features

Tech Stack

Tiered Edit System

Architecture

Event-Driven Architecture

AI Pipeline

Project Structure

Quick Start

Editor

Documentation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎬 OpenScenes

AI-Powered Video Presentation Engine

Gemini 3

✨ Features

Tech Stack

Tiered Edit System

Architecture

Event-Driven Architecture

AI Pipeline

Project Structure

Quick Start

Editor

Documentation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages