Skip to content
/ vision Public
forked from bdougie/vision

Extract frames from videos and analyze them using AI-powered image recognition.

License

Notifications You must be signed in to change notification settings

jpmcb/vision

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

VisionFrameAnalyzer 📹🖼️

Extract frames from videos and analyze them using AI-powered image recognition.

🚀 Overview

VisionFrameAnalyzer is a Go-based tool that:
✅ Extracts frames from a video at set intervals using ffmpeg
✅ Uses an AI-powered vision model to analyze and describe each frame
✅ Provides a structured pipeline for video-to-image processing

✨ Features

  • 🎞 Frame Extraction – Converts video frames into images
  • 🖼 AI-Powered Analysis – Describes each frame using an LLM vision model
  • Multi-Frame Processing – Handles multiple frames efficiently
  • 📝 Detailed Logging – Provides structured logs for debugging

🛠 Tech Stack

  • Go (Golang)
  • FFmpeg (Frame Extraction)
  • Ollama (LLM-powered image analysis)
  • Slog + Tint (Logging)
  • Kubernetes Ready (Optional Multi-Cluster Support)

📦 Installation & Setup

1️⃣ Install Dependencies

MacOS (Homebrew)

brew install ffmpeg
go mod tidy

##2️⃣ Run the Analysis

go run main.go --video path/to/video.mp4 --output output_frames

🛠 Usage Example

go run main.go --video input.mp4 --output frames

📌 Use Cases

📽️ Automated Video Analysis – Extract insights from video feeds 🔍 Content Moderation – Detect and describe images in video content 🛠 Machine Learning Pipelines – Pre-process video datasets for AI models

📜 License

MIT License. See LICENSE for details.

About

Extract frames from videos and analyze them using AI-powered image recognition.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Go 100.0%