An advanced, AI-powered karaoke studio with vocal separation, real-time scoring, and comprehensive audio controls.
- Real-time Scoring: Advanced pitch detection and vocal analysis
- Timestamped Lyrics Display: AI-generated synchronized lyrics with precise timing
- Karaoke-style Highlighting: Dynamic word-by-word lyric progression
- Audio Visualizer: Dynamic waveform visualization during performance
- Performance History: Track your singing progress over time
- Smart Vocal Separation: Replace songs with instrumental versions for perfect karaoke
- Advanced Stem Separation: Isolate drums, bass, vocals, and other instruments into separate tracks
- Timestamped Lyric Transcription: Generate accurate lyrics with precise timing using OpenAI Whisper
- Instant Processing: Optimized AI pipeline with immediate results
- Smart Library Management: Automatic song replacement and organization
- Advanced Mixer: 10-band EQ, gain control, and audio effects
- Auto-Tune: Real-time pitch correction with customizable settings
- Device Selection: Choose your preferred microphone and speakers
- Live Streaming: WebSocket-based audio streaming for remote listening
- Smart Processing: Instant AI results with optimized workflow
- AI Vocal Coach: Personalized feedback powered by Google Gemini
- Key Detection: Automatic song key analysis for better guidance
- Performance Analytics: Detailed scoring breakdown and improvement tips
- Practice Mode: Loop specific sections for targeted practice
- Draggable Windows: Resizable, movable UI panels for custom layouts
- Lion-themed Design: Golden gradient aesthetics with smooth animations
- Dark Mode: Eye-friendly dark interface for studio environments
- Responsive Design: Works on desktop and large tablets
Frontend:
- Node.js 18+
- npm or yarn package manager
AI Backend (Optional but Recommended):
- Python 3.8+
- ffmpeg (for audio processing)
- NVIDIA GPU with CUDA (optional, for faster AI processing)
- 
Clone the repository git clone <repository-url> cd Lion-s-Roar-Studio 
- 
Install dependencies npm install 
- 
Set up environment variables # Copy the example file cp .env.local.example .env.local # Edit .env.local and add your API key GEMINI_API_KEY=your_gemini_api_key_here 
- 
Start the development server npm run dev 
- 
Open your browser - Navigate to http://localhost:3000
- Enjoy the karaoke experience!
 
- Navigate to 
- 
Navigate to the server directory cd server
- 
Create a Python virtual environment # Windows python -m venv venv .\venv\Scripts\activate # macOS/Linux python3 -m venv venv source venv/bin/activate 
- 
Install AI dependencies pip install -r requirements.txt Note: This will install PyTorch, Demucs, Whisper, and other AI models. The download may take several minutes and require ~2-4GB of disk space. 
- 
Start the AI server python main.py 
- 
Verify the setup - AI Server: http://localhost:5000/health
- Should show all dependencies as available
 
- AI Server: 
npm run dev          # Start development server
npm run build        # Build for production
npm run preview      # Preview production build
npm run build-css    # Build Tailwind CSS (watch mode)# In the server directory
python main.py                    # Start AI server
python -m pytest                 # Run tests (if available)
pip install -r requirements.txt  # Install/update dependenciesLion-s-Roar-Studio/
βββ π components/               # React components
β   βββ π ui/                  # Reusable UI components
β   βββ π windows/             # Draggable window components
β   βββ ControlHub.tsx          # Main audio control center
β   βββ LibraryUI.tsx           # Song library management
β   βββ MainStudio.tsx          # Primary karaoke interface
β   βββ SongEditor.tsx          # Lyrics editing with timestamp display
β   βββ LoadingScreen.tsx       # Application loading interface
β   βββ VideoSplashScreen.tsx   # Intro video component
βββ π context/                 # React context providers
β   βββ AppContext.tsx          # Global app state management
βββ π server/                  # Python AI backend
β   βββ main.py                 # Flask server with AI models
β   βββ requirements.txt        # Python dependencies
β   βββ README.md               # Backend documentation
βββ π utils/                   # Utility functions
β   βββ music.ts                # Audio processing and timestamp utilities
βββ π types.ts                 # TypeScript type definitions (TimedLyric, Song, etc.)
βββ π public/                  # Static assets
β   βββ π assets/              # Images and videos
β   βββ favicon.svg             # Custom lion favicon
β   βββ favicon.ico             # Fallback favicon
βββ π¨ index.css               # Tailwind CSS with custom styles
βββ βοΈ App.tsx                 # Main React application
βββ π§ tailwind.config.js      # Tailwind CSS configuration
βββ π¦ package.json            # Project dependencies
- Upload Songs: Click "Upload File(s)" or drag & drop audio files
- Select a Song: Choose from your library or use the demo songs
- Start Singing: Click play and sing along with the lyrics
- View Your Score: Real-time scoring appears as you sing
- Transcribe with Timestamps: Use Advanced Tools β "Transcribe Lyrics" to generate timestamped lyrics
- Song Editor: View both plain lyrics and timed segments with precise timing display
- Karaoke Display: Enjoy word-by-word highlighting synchronized with audio
- Auto-scrolling: Lyrics automatically scroll to keep current line centered
- Multi-language Support: Automatic language detection for international songs
- Intelligent Vocal Removal: Advanced AI creates clean instrumental versions
- Library Integration: Seamlessly replaces songs with processed versions
- Instant Results: Optimized processing pipeline for immediate feedback
- Quality Preservation: Maintains original audio quality and metadata
- Smart Vocal Separation:
- Select a song β Advanced Tools β "Create Instrumental"
- Replaces original song with clean instrumental version
- Perfect for karaoke with confirmation dialog for safety
 
- Advanced Stem Separation:
- Advanced Tools β "Separate All Stems"
- Creates 4 separate tracks: vocals, drums, bass, and other instruments
- Adds new songs to library for individual track practice
 
- Timestamped Lyric Transcription:
- Advanced Tools β "Transcribe Lyrics"
- AI generates accurate lyrics with precise timestamps
- View results in Song Editor with timed segments
- Enjoy synchronized karaoke-style display during playback
- Supports multiple languages with automatic detection
 
- Open Mixer: Click the mixer icon in the control hub
- Adjust EQ: Use the 10-band equalizer for perfect sound
- Apply Effects: Add reverb, echo, or other audio effects
- Auto-Tune: Enable pitch correction for enhanced vocals
# .env.local
GEMINI_API_KEY=your_gemini_api_key_here  # For AI vocal coaching
AI_SERVER_URL=http://localhost:5000      # Local AI server- Vocal Separation: Replaces original song with instrumental version
- Stem Separation: Creates 4 new songs (vocals, drums, bass, other)
- Transcription: Adds timestamped lyrics to existing song
- Confirmation Dialogs: Vocal separation shows warning before replacing original
The app uses custom Tailwind classes for theming:
.lion-button   /* Golden gradient buttons */
.lion-input    /* Dark themed inputs */
.lion-card     /* Consistent card styling */We welcome contributions! Please see our Contributing Guide for details.
- Fork the repository
- Create a feature branch: git checkout -b feature/amazing-feature
- Commit your changes: git commit -m 'Add amazing feature'
- Push to the branch: git push origin feature/amazing-feature
- Open a Pull Request
This project is licensed under the MIT License - see the LICENSE file for details.
- Demucs: AI-powered source separation by Meta Research
- OpenAI Whisper: Robust speech recognition system
- Google Gemini: Advanced AI for vocal coaching features
- React & Vite: Modern web development framework
- Tailwind CSS: Utility-first CSS framework
Frontend Issues:
- Ensure Node.js 18+ is installed
- Clear browser cache and restart dev server
- Check browser console for error messages
AI Backend Issues:
- Verify Python 3.8+ and ffmpeg are installed
- Check server health at http://localhost:5000/health
- Review server console for dependency errors
Performance Tips:
- Use NVIDIA GPU with CUDA for faster AI processing
- Close unused applications during AI processing
- Ensure sufficient disk space (4GB+ recommended)
- Transcription with timestamps works best with clear audio
- Use high-quality audio files for better lyric accuracy
- Vocal separation works best with stereo recordings
- Confirm processing choices as vocal separation replaces original songs
- π§ Email: support@lionsroar.studio
- π Bug Reports: Create an Issue
- π¬ Discussions: GitHub Discussions
π¦ Made with β€οΈ for karaoke enthusiasts worldwide
Unleash your inner lion and roar with confidence!
