Skip to content

AlanTMa/VideoToMusic

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

README.md

🎬🎵 SilentVideoSynth

Advanced AI system for generating emotionally-aligned background music for silent videos using multimodal analysis and hybrid neural networks.

License: MIT Python 3.9+ Docker GPU

🌟 Features

  • 🧠 Multimodal AI Analysis: Combines CLIP-based video understanding with text descriptions
  • 🎯 Emotion-Driven Generation: Uses Russell's Circumplex Model for precise emotional mapping
  • 🎼 Hybrid Architecture: Transformer + LSTM for both musical structure and expressiveness
  • 🎵 Multi-Instrument Support: Generate complete orchestral arrangements
  • ⚡ Real-Time Processing: Stream generation for immediate feedback
  • 📊 Comprehensive Evaluation: Quality metrics and emotion alignment scoring
  • 🌐 Modern Web Interface: React/Next.js frontend with real-time updates

🏗️ Architecture

Model Components

  1. Multimodal Transformer Encoder

    • Processes video semantic features (CLIP)
    • Analyzes emotion patterns
    • Tracks motion and scene changes
    • Integrates text descriptions
  2. Chord Progression Transformer

    • Generates harmonic structure
    • Maintains musical coherence
    • Supports various musical styles
  3. Expressive LSTM Decoder

    • Adds note-level details
    • Controls dynamics and articulation
    • Manages rhythmic patterns

Technology Stack

Backend:

  • FastAPI for high-performance API
  • PyTorch for deep learning
  • Transformers (CLIP) for multimodal understanding
  • Pretty MIDI for music generation
  • Redis for caching and session management

Frontend:

  • Next.js with TypeScript
  • Tailwind CSS for styling
  • D3.js for visualizations
  • WebSocket for real-time updates

Infrastructure:

  • Docker containerization
  • Nginx reverse proxy
  • PostgreSQL database
  • Prometheus monitoring

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors