ReelCraft

Automatically transform articles into engaging short-form videos (Reels/TikToks) using AI.

ReelCraft is an automated video generation pipeline that converts web articles into 30-60 second social media videos. It uses Google's Gemini AI for script generation and text-to-speech, Pexels for stock media, and FFmpeg for video editing. Now with a modern web interface for easy video generation!

Features

Core Features

Automatic Script Generation: Converts articles into engaging, fast-paced scripts optimized for short-form video
AI-Powered Voice Over: Generates natural-sounding voice narration using Gemini TTS
Smart Asset Selection: Automatically finds and downloads relevant images/videos from Pexels
Professional Video Editing: Combines visual assets, voice-over, and background music into polished videos
Parallel Processing: Efficiently generates audio and downloads assets concurrently
Database Integration: SQLite database for tracking videos, jobs, and metadata
Cloud Storage Support: Optional Cloudflare R2 integration for scalable video storage
Job Management: Background job tracking with status updates and cancellation
Langfuse Integration: Track and monitor AI model calls and performance

✨ New Visual Polish Features

Dynamic Audio Ducking: Background music automatically lowers during voiceover and rises during pauses (professional sidechain compression)
Text-Only Scenes: AI generates punchy text overlays (1-5 words) on solid backgrounds to emphasize key points
Smart Aspect Ratios: Landscape videos (16:9) display with blurred background fill for portrait frames (9:16)
Smooth Transitions: Dynamic scene transitions (fade, wipe, slide) instead of hard cuts for polished flow

Web Interface Features

🌐 Modern Web UI: User-friendly interface for generating videos without code
⚡ Real-time Progress: WebSocket-powered live updates during video generation
🎬 Video Gallery: Browse, preview, and download all generated videos
🗑️ Video Management: Delete videos from both local and cloud storage
📊 Job Tracking: Monitor and cancel background video generation jobs
📱 Responsive Design: Works seamlessly on desktop and mobile devices
🔌 REST API: Full-featured API with interactive documentation
🎯 One-Click Generation: Simple URL input to video output workflow

How It Works

Article URL -> Script Generation -> Audio Generation -> Asset Download -> Video Editing -> Final Video

Content Extraction: Fetches article content using FireCrawl
Script Generation: Gemini AI converts the article into 7-15 scene scripts with visual keywords
Audio Generation: Parallel generation of voice-over for each scene, with duration measurement
Audio-Driven Pacing: Each scene's duration is set by its voiceover length (ensures natural speech)
Asset Download: Concurrent download of images/videos from Pexels based on keywords
Video Composition: FFmpeg stitches visual assets to match audio timing, adds background music and effects

Web Interface

ReelCraft now includes a modern, responsive web interface that makes video generation accessible to everyone!

Features

🎨 Beautiful Dark Theme UI: Modern, eye-friendly interface with smooth animations
⚡ Real-time Updates: Live progress tracking via WebSocket connection
🎬 Video Gallery: Browse all your generated videos with preview thumbnails
📥 Easy Downloads: One-click download for any generated video
📱 Mobile Responsive: Works perfectly on desktop, tablet, and mobile devices
🔗 Simple Workflow: Just paste a URL and click generate!

How to Use

Start the server: python main.py
Open http://localhost:8000 in your browser
Enter an article URL
Click "Generate Video"
Watch real-time progress updates
Preview and download your video!

API Documentation

The FastAPI backend provides comprehensive API documentation:

Interactive API Docs: http://localhost:8000/docs (Swagger UI)
Alternative Docs: http://localhost:8000/redoc (ReDoc)
Detailed Guide: See API.md for complete reference

API Endpoints

Health & Status

GET /health - Health check

Video Generation

POST /api/generate-video - Generate video from URL (creates background job)
WS /ws - WebSocket for real-time progress updates

Video Management

GET /api/videos - List all generated videos with metadata
GET /api/videos/{video_id}/file - Stream/download video file by ID
DELETE /api/videos/{video_id} - Delete video (local and/or cloud storage)

Job Management

GET /api/jobs - List all background jobs
GET /api/jobs/{job_id} - Get specific job status
POST /api/jobs/{job_id}/cancel - Cancel running job

Installation

Prerequisites

Python 3.11 or higher
FFmpeg installed on your system
Virtual environment tool (venv or uv)

Setup

Clone the repository
```
git clone <repository-url>
cd reelcraft
```

Create and activate virtual environment

python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

Install dependencies

Using uv (recommended):
```
uv sync
```
Or using pip:
```
pip install -e .
```

Set up environment variables

Create a .env file in the project root:

# Required API Keys
GEMINI_API_KEY=your_gemini_api_key_here
PEXELS_API_KEY=your_pexels_api_key_here
FIRECRAWL_API_KEY=your_firecrawl_api_key_here

# Optional - Cloud Storage (Cloudflare R2)
R2_ENABLED=false
R2_ENDPOINT_URL=https://your-account-id.r2.cloudflarestorage.com
R2_ACCESS_KEY_ID=your_r2_access_key
R2_SECRET_ACCESS_KEY=your_r2_secret_key
R2_BUCKET_NAME=reelcraft-videos
R2_PUBLIC_URL=https://videos.yourdomain.com

# Optional - Monitoring
LANGFUSE_PUBLIC_KEY=your_langfuse_public_key
LANGFUSE_SECRET_KEY=your_langfuse_secret_key
LANGFUSE_HOST=https://cloud.langfuse.com

API Keys

Required:

Gemini API: Get your key at Google AI Studio
Pexels API: Get your key at Pexels API
FireCrawl API: Get your key at FireCrawl

Optional:

Cloudflare R2: Set up at Cloudflare Dashboard → R2 → Create bucket
Langfuse: Get your keys at Langfuse

Usage

Web Interface (Recommended)

The easiest way to use ReelCraft is through the web interface:

Start the server

# Using uv
uv run python main.py

# Or with activated venv
python main.py

Open your browser

Navigate to http://localhost:8000
Generate videos
- Enter an article URL
- Click "Generate Video"
- Watch real-time progress updates
- Preview and download your video

The web interface provides:

Real-time progress tracking via WebSocket
Video preview and download
Gallery of all generated videos
Modern, responsive design

For detailed API documentation, visit http://localhost:8000/docs when the server is running, or see API.md.

Programmatic Usage

You can also use ReelCraft programmatically:

import asyncio
from pipeline import pipeline

# Generate a video from an article URL
asyncio.run(pipeline("https://example.com/article"))

Advanced Usage

from pipeline import pipeline, generate_assets
from utils.video_editing import script_to_asset_details, video_editing_pipeline
import asyncio
import json

async def custom_pipeline():
    # Step 1: Get article and generate script
    article_url = "https://example.com/article"

    # ... (rest of pipeline logic)

    # Step 2: Add custom background music
    asset_details = await script_to_asset_details(
        script,
        background_music_path="path/to/music.mp3"
    )

    # Step 3: Generate video
    await video_editing_pipeline(asset_details)

asyncio.run(custom_pipeline())

API Usage

You can also interact with ReelCraft programmatically via the REST API:

import requests

# Generate a video via API
response = requests.post(
    "http://localhost:8000/api/generate-video",
    json={"url": "https://example.com/article"}
)

result = response.json()
print(f"Video created: {result['video_path']}")

# List all videos
videos = requests.get("http://localhost:8000/api/videos").json()
for video in videos['videos']:
    print(f"{video['filename']} - {video['size_mb']} MB")

Or using cURL:

# Generate video
curl -X POST "http://localhost:8000/api/generate-video" \
  -H "Content-Type: application/json" \
  -d '{"url": "https://example.com/article"}'

# List videos
curl "http://localhost:8000/api/videos"

For complete API documentation, visit http://localhost:8000/docs when the server is running, or see API.md.

Project Structure

reelcraft/
├── main.py                  # Server entry point (start here!)
├── pipeline.py              # Core video generation pipeline
├── reelcraft.db            # SQLite database (auto-created)
├── frontend/                # Web interface
│   ├── index.html          # Main UI page
│   ├── style.css           # Styling
│   └── app.js              # Frontend logic & WebSocket client
├── services/                # Backend services
│   ├── api.py              # FastAPI application & REST endpoints
│   ├── database.py         # SQLAlchemy models & database setup
│   └── storage.py          # Cloud storage integration (R2)
├── config/
│   ├── directories.py       # Asset folder paths
│   ├── prompts.py          # AI prompt templates
│   ├── logger.py           # Logging configuration
│   └── langfuse_config.py  # Langfuse monitoring setup
├── utils/
│   ├── ai.py               # Gemini AI integration (LLM + TTS)
│   ├── assets.py           # Pexels asset download
│   ├── fire_crawl.py       # Article content extraction
│   ├── video_editing.py    # FFmpeg video composition
│   └── http_client.py      # HTTP utilities
└── assets/
    └── temp/
        ├── audio/          # Generated voice-overs
        ├── videos/         # Downloaded video assets
        ├── images/         # Downloaded image assets
        └── outputs/        # Final generated videos (if stored locally)

Storage & Database

Database

ReelCraft uses SQLite with SQLAlchemy for storing video metadata and job information:

Videos Table: Stores video metadata (title, source URL, file path, storage location, duration, size)
Jobs Table: Tracks background video generation jobs with status and progress
Automatic Setup: Database is created automatically on first run at reelcraft.db

Cloud Storage (Optional)

ReelCraft supports Cloudflare R2 (S3-compatible) for scalable video storage:

Benefits:

Unlimited storage without local disk constraints
Global CDN delivery for fast video access
Automatic upload after video generation
Cost-effective storage (R2 has no egress fees)

Setup:

Install boto3 (if not already installed):
```
uv add boto3
# or: pip install boto3
```
Create R2 bucket at Cloudflare Dashboard:
- Navigate to R2 → Create bucket
- Note your bucket name and account ID

Configure environment variables in .env:

R2_ENABLED=true
R2_ENDPOINT_URL=https://your-account-id.r2.cloudflarestorage.com
R2_ACCESS_KEY_ID=your_access_key
R2_SECRET_ACCESS_KEY=your_secret_key
R2_BUCKET_NAME=reelcraft-videos
R2_PUBLIC_URL=https://videos.yourdomain.com  # Optional: custom domain

Behavior:
- When enabled, videos are automatically uploaded to R2 after generation
- Local copies can be deleted to save disk space
- Videos are served from cloud URL instead of local file

Storage Locations:

local: Video stored only on server disk
cloud: Video stored in Cloudflare R2 (can delete local copy)

Configuration

Directories

Edit config/directories.py to change asset locations:

AUDIO_DIR = "assets/temp/audio"
VIDEO_DIR = "assets/temp/videos"
IMAGE_DIR = "assets/temp/images"
OUTPUT_FOLDER = Path("assets/temp/outputs")

Script Generation

Customize the script generation prompt in config/prompts.py to adjust:

Video tone and style
Scene count (7-15 scenes)
Script length (30-60 seconds)
Asset keyword generation

Audio Settings

Adjust voice settings in utils/ai.py:

# Change voice
voice_name="Kore"  # Available voices: Kore, Aoede, Charon, Fenrir, etc.

# Adjust concurrency (avoid rate limits)
AUDIO_GENERATION_SEMAPHORE = asyncio.Semaphore(3)  # Max 3 concurrent requests

Video Settings

Modify video parameters in utils/video_editing.py:

fps = 25              # Frame rate
asset_width = 720     # Video width (portrait: 720x1280)
asset_height = 1280   # Video height

# Audio mixing volumes
background_volume = 0.2  # Background music volume
voiceover_volume = 2.0   # Voice-over volume

Pipeline Details

Script Structure

Each generated script contains:

{
  "title": "Video Title",
  "scenes": [
    {
      "scene_number": 1,
      "script": "Scene narration text",
      "asset_keywords": ["keyword1", "keyword2", "keyword3"],
      "asset_type": "image/video",
      "audio_file_path": "path/to/audio.wav",
      "duration": 5.2,
      "asset_file_path": "path/to/visual.mp4"
    }
  ]
}

Processing Flow

Content Extraction (utils/fire_crawl.py)
- Fetches article markdown using FireCrawl API
- WebSocket update: "Extracting article content..."
Script Generation (utils/ai.py + config/prompts.py)
- Gemini AI generates 7-15 scenes with narration and asset keywords
- WebSocket update: "Generating script..."
Audio Generation (services/pipeline.py:56-71)
- Parallel TTS generation for all scenes (max 3 concurrent)
- Audio-driven pacing: Measures duration of each generated audio file
- Sets scene duration based on voiceover length (ensures natural speech)
- WebSocket update: Progress updates during generation
Asset Download (services/pipeline.py:74-76)
- Parallel download of images/videos from Pexels
- Handles "image/video" asset types intelligently
Video Composition (utils/video_editing.py)
- Combines all audio files sequentially
- Stitches visual assets to match audio duration (images displayed for full voiceover length, videos trimmed/looped)
- Applies Ken Burns effect (zoom/pan) for still images
- Mixes voice-over with background music
- Exports final video

Monitoring with Langfuse

If Langfuse is configured, the pipeline automatically tracks:

LLM API calls and token usage
Generation parameters and responses
Error tracking and debugging

View traces at your Langfuse dashboard.

Troubleshooting

FFmpeg Not Found

# macOS
brew install ffmpeg

# Ubuntu/Debian
sudo apt-get install ffmpeg

# Windows
# Download from https://ffmpeg.org/download.html

Rate Limit Errors

Adjust the semaphore limit in utils/ai.py:

AUDIO_GENERATION_SEMAPHORE = asyncio.Semaphore(2)  # Reduce concurrent requests

No Assets Found

Check your Pexels API key
Try more generic keywords in the script
Verify internet connection

Audio/Video Sync Issues

The pipeline uses audio-driven duration - each scene's visual duration is set by the voiceover length, ensuring natural-sounding narration. If sync issues occur:

Check that audio files are being generated correctly
Verify scene duration calculations in services/pipeline.py:69-71
Ensure FFmpeg is properly installed and accessible
Review audio file integrity with ffprobe

Dependencies

Core dependencies:

ffmpeg-python - Video editing
pydub - Audio processing
google-genai - Gemini AI (LLM + TTS)
firecrawl-py - Article extraction
httpx - Async HTTP client
requests - Pexels API
fastapi - Web framework and API
uvicorn - ASGI server
sqlalchemy - Database ORM
aiosqlite - Async SQLite driver
boto3 - AWS S3/R2 client (optional, for cloud storage)
langfuse - Monitoring (optional)
python-dotenv - Environment configuration

Limitations

Video Duration: Currently optimized for 30-60 second videos
Rate Limits: Free tier APIs have request limits (use semaphores)
Asset Quality: Dependent on Pexels search results
Gemini TTS Voices: Limited to available prebuilt voices
FFmpeg Dependencies: Requires system FFmpeg installation

Roadmap

Web UI for easier usage (FastAPI + WebSocket)
Subtitle generation support
Custom font and styling options
Multiple aspect ratios (square, landscape)
Direct social media upload integration
Batch processing multiple articles
Custom background music library

Contributing

Contributions are welcome! Please:

Fork the repository
Create a feature branch
Make your changes
Submit a pull request

License

[Add your license here]

Acknowledgments

Google Gemini AI for LLM and TTS capabilities
Pexels for stock media API
FireCrawl for article extraction
FFmpeg for video processing

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
config		config
frontend		frontend
mocks		mocks
resources		resources
scripts		scripts
services		services
tests		tests
utils		utils
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

ReelCraft

Features

Core Features

✨ New Visual Polish Features

Web Interface Features

How It Works

Web Interface

Features

How to Use

API Documentation

API Endpoints

Installation

Prerequisites

Setup

API Keys

Usage

Web Interface (Recommended)

Programmatic Usage

Advanced Usage

API Usage

Project Structure

Storage & Database

Database

Cloud Storage (Optional)

Configuration

Directories

Script Generation

Audio Settings

Video Settings

Pipeline Details

Script Structure

Processing Flow

Monitoring with Langfuse

Troubleshooting

FFmpeg Not Found

Rate Limit Errors

No Assets Found

Audio/Video Sync Issues

Dependencies

Limitations

Roadmap

Contributing

License

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages