ClipFarmer 🎬

Automated Short-Form Video Clipping Pipeline

ClipFarmer is an end-to-end video processing system that converts long-form videos into short, vertical, attention-optimized clips for platforms like Instagram Reels, YouTube Shorts, and TikTok.

It automates clip selection, subject tracking, and vertical reframing using real ML inference and production-style video pipelines.

Key Features

🔊 Engagement-aware clipping via audio intensity analysis
🧠 Face tracking to keep the subject centered frame-by-frame
📱 Automatic 9:16 vertical reframing
🎧 Lossless audio preservation
⚙️ End-to-end automation from upload to final clip

System Overview

ClipFarmer is built as a full-stack, cloud-deployed media pipeline.

Frontend: Upload, preview, and playback
Backend: Video processing, ML inference, FFmpeg orchestration
Infrastructure: AWS EC2 + Vercel

Processing Pipeline

User uploads a long-form video
Audio analysis using FFmpeg
Loudest continuous 30-second segment is identified
Frame-by-frame face detection using MediaPipe
Smooth subject center tracking
Video is cropped to 9:16 vertical format
Original audio is re-muxed
Final short-form clip is produced

Architecture

Frontend

Next.js (App Router)
Deployed on Vercel
Handles uploads, previews, and output playback

Backend

FastAPI (Python)
Hosted on AWS EC2
Responsible for video processing, ML inference, and FFmpeg execution

Processing Stack

FFmpeg — audio analysis, clipping, remuxing
MediaPipe (Face Detection) — subject tracking
OpenCV — frame-level transformations

System Flow

graph TD
    subgraph Client_Layer [Frontend Layer]
        style Client_Layer fill:#f9f9f9,stroke:#333,stroke-width:2px
        UI[User Interface]
        NextAPI[Next.js API Route]
    end

    subgraph Server_Layer [Backend Layer]
        style Server_Layer fill:#eef,stroke:#333,stroke-width:2px
        FastAPI[FastAPI Server]
        
        subgraph Core_Logic [Processing Engine]
            style Core_Logic fill:#dde,stroke:#666,stroke-dasharray: 5 5
            Loudest[Audio Analysis]
            Process[CV & Reframing]
        end
    end

    UI -->|Upload Video| NextAPI
    NextAPI -->|Forward Request| FastAPI
    FastAPI -->|Spawn Subprocess| Loudest
    Loudest -->|Extract Highlight| Process
    Process -->|Vertical Crop| FastAPI
    FastAPI -->|Return Final MP4| NextAPI
    NextAPI -->|Stream Response| UI

Running Locally

Backend (Python)

Requirements

Python 3.10+ (recommended)
FFmpeg
Linux / macOS / WSL

Install Dependencies

pip install fastapi uvicorn mediapipe==0.10.14 opencv-python numpy
sudo apt install ffmpeg libgl1

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
backend		backend
frontend		frontend
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ClipFarmer 🎬

Key Features

System Overview

Processing Pipeline

Architecture

Frontend

Backend

Processing Stack

System Flow

Running Locally

Backend (Python)

Requirements

Install Dependencies

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ClipFarmer 🎬

Key Features

System Overview

Processing Pipeline

Architecture

Frontend

Backend

Processing Stack

System Flow

Running Locally

Backend (Python)

Requirements

Install Dependencies

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages