GitHub - htekdev/vidpipe: CLI tool that auto-processes video recordings: transcribes, removes silence, generates captions, creates shorts, social posts, and more

 ██╗   ██╗██╗██████╗ ██████╗ ██╗██████╗ ███████╗
 ██║   ██║██║██╔══██╗██╔══██╗██║██╔══██╗██╔════╝
 ██║   ██║██║██║  ██║██████╔╝██║██████╔╝█████╗  
 ╚██╗ ██╔╝██║██║  ██║██╔═══╝ ██║██╔═══╝ ██╔══╝  
  ╚████╔╝ ██║██████╔╝██║     ██║██║     ███████╗
   ╚═══╝  ╚═╝╚═════╝ ╚═╝     ╚═╝╚═╝     ╚══════╝

Your AI video editor — turn raw recordings into shorts, reels, captions, social posts, and blog posts. Record once, publish everywhere.

An agentic video editor that watches for new recordings and edits them into social-media-ready content — shorts, reels, captions, blog posts, and platform-tailored social posts — using GitHub Copilot SDK AI agents and OpenAI Whisper.

npm install -g vidpipe

✨ Features

🎙️ Whisper Transcription — Word-level timestamps	📐 Split-Screen Layouts — Portrait, square, and feed
🔇 AI Silence Removal — Context-aware, capped at 20%	💬 Karaoke Captions — Word-by-word highlighting
✂️ Short Clips — Best 15–60s moments, multi-segment	🎞️ Medium Clips — 1–3 min with crossfade transitions
📑 Chapter Detection — JSON, Markdown, YouTube, FFmeta	📱 Social Posts — TikTok, YouTube, Instagram, LinkedIn, X
📰 Blog Post — Dev.to style with web-sourced links	🎨 Brand Voice — Custom tone, hashtags via brand.json
🔍 Face Detection — ONNX-based webcam cropping	🚀 Auto-Publish — Scheduled posting to TikTok, YouTube, Instagram, LinkedIn, X

🚀 Quick Start

# Install globally
npm install -g vidpipe

# Set up your environment
# Unix/Mac
cp .env.example .env
# Windows (PowerShell)
Copy-Item .env.example .env

# Then edit .env and add your OpenAI API key (REQUIRED):
#   OPENAI_API_KEY=sk-your-key-here

# Verify all prerequisites are met
vidpipe --doctor

# Process a single video
vidpipe /path/to/video.mp4

# Watch a folder for new recordings
vidpipe --watch-dir ~/Videos/Recordings

# Full example with options
vidpipe \
  --watch-dir ~/Videos/Recordings \
  --output-dir ~/Content/processed \
  --openai-key sk-... \
  --brand ./brand.json \
  --verbose

Prerequisites:

Node.js 20+

FFmpeg 6.0+ — Auto-bundled on common platforms (Windows x64, macOS, Linux x64) via ffmpeg-static. On other architectures, install system FFmpeg (see Troubleshooting). Override with FFMPEG_PATH env var if you need a specific build.

OpenAI API key (required) — Get one at platform.openai.com/api-keys. Needed for Whisper transcription and all AI features.

GitHub Copilot subscription — Required for AI agent features (shorts generation, social media posts, summaries, blog posts). See GitHub Copilot.

See Getting Started for full setup instructions.

🎮 CLI Usage

vidpipe [options] [video-path]
vidpipe init              # Interactive setup wizard
vidpipe review            # Open post review web app
vidpipe schedule          # View posting schedule

Option	Description
`--doctor`	Check that all prerequisites (FFmpeg, API keys, etc.) are installed and configured
`[video-path]`	Process a specific video file (implies `--once`)
`--watch-dir <path>`	Folder to watch for new recordings
`--output-dir <path>`	Output directory (default: `./recordings`)
`--openai-key <key>`	OpenAI API key
`--exa-key <key>`	Exa AI key for web search in social posts
`--brand <path>`	Path to `brand.json` (default: `./brand.json`)
`--once`	Process next video and exit
`--no-silence-removal`	Skip silence removal
`--no-shorts`	Skip short clip extraction
`--no-medium-clips`	Skip medium clip generation
`--no-social`	Skip social media posts
`--no-social-publish`	Skip social media queue-build stage
`--late-api-key <key>`	Override Late API key
`--no-captions`	Skip caption generation/burning
`--no-git`	Skip git commit/push
`-v, --verbose`	Debug-level logging

📁 Output Structure

recordings/
└── my-awesome-demo/
    ├── my-awesome-demo.mp4                  # Original video
    ├── my-awesome-demo-edited.mp4           # Silence-removed
    ├── my-awesome-demo-captioned.mp4        # With burned-in captions
    ├── transcript.json                      # Word-level transcript
    ├── transcript-edited.json               # Timestamps adjusted for silence removal
    ├── README.md                            # AI-generated summary with screenshots
    ├── captions/
    │   ├── captions.srt                     # SubRip subtitles
    │   ├── captions.vtt                     # WebVTT subtitles
    │   └── captions.ass                     # Advanced SSA (karaoke-style)
    ├── shorts/
    │   ├── catchy-title.mp4                 # Landscape base clip
    │   ├── catchy-title-captioned.mp4       # Landscape + burned captions
    │   ├── catchy-title-portrait.mp4        # 9:16 split-screen
    │   ├── catchy-title-portrait-captioned.mp4  # Portrait + captions + hook overlay
    │   ├── catchy-title-feed.mp4            # 4:5 split-screen
    │   ├── catchy-title-square.mp4          # 1:1 split-screen
    │   ├── catchy-title.md                  # Clip metadata
    │   └── catchy-title/
    │       └── posts/                       # Per-short social posts (5 platforms)
    ├── medium-clips/
    │   ├── deep-dive-topic.mp4              # Landscape base clip
    │   ├── deep-dive-topic-captioned.mp4    # With burned captions
    │   ├── deep-dive-topic.md               # Clip metadata
    │   └── deep-dive-topic/
    │       └── posts/                       # Per-clip social posts (5 platforms)
    ├── chapters/
    │   ├── chapters.json                    # Structured chapter data
    │   ├── chapters.md                      # Markdown table
    │   ├── chapters.ffmetadata              # FFmpeg metadata format
    │   └── chapters-youtube.txt             # YouTube description timestamps
    └── social-posts/
        ├── tiktok.md                        # Full-video social posts
        ├── youtube.md
        ├── instagram.md
        ├── linkedin.md
        ├── x.md
        └── devto.md                         # Dev.to blog post

📺 Review App

VidPipe includes a built-in web app for reviewing, editing, and scheduling social media posts before publishing.

Review and approve posts across YouTube, TikTok, Instagram, LinkedIn, and X/Twitter

# Launch the review app
vidpipe review

Platform tabs — Filter posts by platform (YouTube, TikTok, Instagram, LinkedIn, X)
Video preview — See the video thumbnail and content before approving
Keyboard shortcuts — Arrow keys to navigate, Enter to approve, Backspace to reject
Smart scheduling — Posts are queued with optimal timing per platform

🔄 Pipeline

graph LR
    A[📥 Ingest] --> B[🎙️ Transcribe]
    B --> C[🔇 Silence Removal]
    C --> D[💬 Captions]
    D --> E[🔥 Caption Burn]
    E --> F[✂️ Shorts]
    F --> G[🎞️ Medium Clips]
    G --> H[📑 Chapters]
    H --> I[📝 Summary]
    I --> J[📱 Social Media]
    J --> K[📱 Short Posts]
    K --> L[📱 Medium Posts]
    L --> M[📰 Blog]
    M --> N[📦 Queue Build]
    N --> O[🔄 Git Push]

    style A fill:#2d5a27,stroke:#4ade80
    style B fill:#1e3a5f,stroke:#60a5fa
    style E fill:#5a2d27,stroke:#f87171
    style F fill:#5a4d27,stroke:#fbbf24
    style O fill:#2d5a27,stroke:#4ade80

#	Stage	Description
1	Ingestion	Copies video, extracts metadata with FFprobe
2	Transcription	Extracts audio → OpenAI Whisper for word-level transcription
3	Silence Removal	AI detects dead-air segments; context-aware removals capped at 20%
4	Captions	Generates `.srt`, `.vtt`, and `.ass` subtitle files with karaoke word highlighting
5	Caption Burn	Burns ASS captions into video (single-pass encode when silence was also removed)
6	Shorts	AI identifies best 15–60s moments; extracts single and composite clips with 6 variants per short
7	Medium Clips	AI identifies 1–3 min standalone segments with crossfade transitions
8	Chapters	AI detects topic boundaries; outputs JSON, Markdown, FFmetadata, and YouTube timestamps
9	Summary	AI writes a Markdown README with captured screenshots
10	Social Media	Platform-tailored posts for TikTok, YouTube, Instagram, LinkedIn, and X
11	Short Posts	Per-short social media posts for all 5 platforms
12	Medium Clip Posts	Per-medium-clip social media posts for all 5 platforms
13	Blog	Dev.to blog post with frontmatter, web-sourced links via Exa
14	Queue Build	Builds publish queue from social posts with scheduled slots
15	Git Push	Auto-commits and pushes to `origin main`

Each stage can be independently skipped with --no-* flags. A stage failure does not abort the pipeline — subsequent stages proceed with whatever data is available.

🤖 LLM Providers

VidPipe supports multiple LLM providers:

Provider	Env Var	Default Model	Notes
`copilot` (default)	—	Claude Opus 4.6	Uses GitHub Copilot auth
`openai`	`OPENAI_API_KEY`	gpt-4o	Direct OpenAI API
`claude`	`ANTHROPIC_API_KEY`	claude-opus-4.6	Direct Anthropic API

Set LLM_PROVIDER in your .env or pass via CLI. Override model with LLM_MODEL.

The pipeline tracks token usage and estimated cost across all providers, displaying a summary at the end of each run.

⚙️ Configuration

Configuration is loaded from CLI flags → environment variables → .env file → defaults.

# .env
OPENAI_API_KEY=sk-your-key-here
WATCH_FOLDER=/path/to/recordings
OUTPUT_DIR=/path/to/output
# EXA_API_KEY=your-exa-key       # Optional: enables web search in social/blog posts
# BRAND_PATH=./brand.json         # Optional: path to brand voice config
# FFMPEG_PATH=/usr/local/bin/ffmpeg
# FFPROBE_PATH=/usr/local/bin/ffprobe
# LATE_API_KEY=sk_your_key_here   # Optional: Late API for social publishing

Social media publishing is configured via schedule.json and the Late API. See Social Publishing Guide for details.

📚 Documentation

Guide	Description
Getting Started	Prerequisites, installation, and first run
Configuration	All CLI flags, env vars, skip options, and examples
FFmpeg Setup	Platform-specific install (Windows, macOS, Linux, ARM64)
Brand Customization	Customize AI voice, vocabulary, hashtags, and content style
Social Publishing	Review, schedule, and publish social posts via Late API

🏗️ Architecture

Agentic architecture built on the GitHub Copilot SDK — each editing task is handled by a specialized AI agent:

graph TD
    BP[🧠 BaseAgent] --> SRA[SilenceRemovalAgent]
    BP --> SA[SummaryAgent]
    BP --> SHA[ShortsAgent]
    BP --> MVA[MediumVideoAgent]
    BP --> CA[ChapterAgent]
    BP --> SMA[SocialMediaAgent]
    BP --> BA[BlogAgent]

    SRA -->|tools| T1[detect_silence, decide_removals]
    SHA -->|tools| T2[plan_shorts]
    MVA -->|tools| T3[plan_medium_clips]
    CA -->|tools| T4[generate_chapters]
    SA -->|tools| T5[capture_frame, write_summary]
    SMA -->|tools| T6[search_links, create_posts]
    BA -->|tools| T7[search_web, write_blog]

    style BP fill:#1e3a5f,stroke:#60a5fa,color:#fff

Each agent communicates with the LLM through structured tool calls, ensuring reliable, parseable outputs.

🛠️ Tech Stack

Technology	Purpose
TypeScript	Language (ES2022, ESM)
GitHub Copilot SDK	AI agent framework
OpenAI Whisper	Speech-to-text
FFmpeg	Video/audio processing
Sharp	Image analysis (webcam detection)
Commander.js	CLI framework
Chokidar	File system watching
Winston	Logging
Exa AI	Web search for social posts and blog

🗺️ Roadmap

Automated social posting — Publish directly to platforms via Late API
Multi-language support — Transcription and summaries in multiple languages
Custom templates — User-defined Markdown & social post templates
Web dashboard — Browser UI for reviewing and editing outputs
Batch processing — Process an entire folder of existing videos
Custom short criteria — Configure what makes a "good" short for your content
Thumbnail generation — Auto-generate branded thumbnails for shorts

🔧 Troubleshooting

`No binary found for architecture` during install

ffmpeg-static (an optional dependency) bundles FFmpeg for common platforms. On unsupported architectures, it skips gracefully and vidpipe falls back to your system FFmpeg.

Fix: Install FFmpeg on your system:

Windows: winget install Gyan.FFmpeg
macOS: brew install ffmpeg
Linux: sudo apt install ffmpeg (Debian/Ubuntu) or sudo dnf install ffmpeg (Fedora)

You can also point to a custom binary: export FFMPEG_PATH=/path/to/ffmpeg

Run vidpipe doctor to verify your setup.

📄 License

ISC © htekdev

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
.github		.github
assets		assets
cicd		cicd
docs		docs
site		site
src		src
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.npmignore		.npmignore
GitVersion.yml		GitVersion.yml
README.md		README.md
SECURITY.md		SECURITY.md
Screenshot 2026-02-13 141102.png		Screenshot 2026-02-13 141102.png
brand.json		brand.json
coverage-out.tmp		coverage-out.tmp
fc-debug.txt		fc-debug.txt
package-lock.json		package-lock.json
package.json		package.json
review-threads.json		review-threads.json
review.bat		review.bat
review.ico		review.ico
schedule.json		schedule.json
tsconfig.json		tsconfig.json
tsup.config.ts		tsup.config.ts
unresolved-threads.json		unresolved-threads.json
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

✨ Features

🚀 Quick Start

🎮 CLI Usage

📁 Output Structure

📺 Review App

🔄 Pipeline

🤖 LLM Providers

⚙️ Configuration

📚 Documentation

🏗️ Architecture

🛠️ Tech Stack

🗺️ Roadmap

🔧 Troubleshooting

`No binary found for architecture` during install

📄 License

About

Uh oh!

Releases 9

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

htekdev/vidpipe

Folders and files

Latest commit

History

Repository files navigation

✨ Features

🚀 Quick Start

🎮 CLI Usage

📁 Output Structure

📺 Review App

🔄 Pipeline

🤖 LLM Providers

⚙️ Configuration

📚 Documentation

🏗️ Architecture

🛠️ Tech Stack

🗺️ Roadmap

🔧 Troubleshooting

No binary found for architecture during install

📄 License

About

Resources

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 9

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

`No binary found for architecture` during install

Packages