Extract frames from videos and analyze them using AI-powered image recognition.
VisionFrameAnalyzer is a Go-based tool that:
✅ Extracts frames from a video at set intervals using ffmpeg
✅ Uses an AI-powered vision model to analyze and describe each frame
✅ Provides a structured pipeline for video-to-image processing
- 🎞 Frame Extraction – Converts video frames into images
- 🖼 AI-Powered Analysis – Describes each frame using an LLM vision model
- ⚡ Multi-Frame Processing – Handles multiple frames efficiently
- 📝 Detailed Logging – Provides structured logs for debugging
- Go (Golang)
- FFmpeg (Frame Extraction)
- Ollama (LLM-powered image analysis)
- Slog + Tint (Logging)
- Kubernetes Ready (Optional Multi-Cluster Support)
brew install ffmpeg
go mod tidy
##2️⃣ Run the Analysis
go run main.go --video path/to/video.mp4 --output output_frames
go run main.go --video input.mp4 --output frames
📽️ Automated Video Analysis – Extract insights from video feeds 🔍 Content Moderation – Detect and describe images in video content 🛠 Machine Learning Pipelines – Pre-process video datasets for AI models
MIT License. See LICENSE for details.