🦁 Frontend

An advanced, AI-powered karaoke studio with vocal separation, real-time scoring, and comprehensive audio controls.

✨ Features

🎵 Core Karaoke Experience

Real-time Scoring: Advanced pitch detection and vocal analysis
Timestamped Lyrics Display: AI-generated synchronized lyrics with precise timing
Karaoke-style Highlighting: Dynamic word-by-word lyric progression
Audio Visualizer: Dynamic waveform visualization during performance
Performance History: Track your singing progress over time

🤖 AI-Powered Audio Processing

Smart Vocal Separation: Replace songs with instrumental versions for perfect karaoke
Advanced Stem Separation: Isolate drums, bass, vocals, and other instruments into separate tracks
Timestamped Lyric Transcription: Generate accurate lyrics with precise timing using OpenAI Whisper
Instant Processing: Optimized AI pipeline with immediate results
Smart Library Management: Automatic song replacement and organization

🎚️ Professional Audio Controls

Advanced Mixer: 10-band EQ, gain control, and audio effects
Auto-Tune: Real-time pitch correction with customizable settings
Device Selection: Choose your preferred microphone and speakers
Live Streaming: WebSocket-based audio streaming for remote listening
Smart Processing: Instant AI results with optimized workflow

🎯 Smart Coaching System

AI Vocal Coach: Personalized feedback powered by Google Gemini
Key Detection: Automatic song key analysis for better guidance
Performance Analytics: Detailed scoring breakdown and improvement tips
Practice Mode: Loop specific sections for targeted practice

🎨 Modern Interface

Draggable Windows: Resizable, movable UI panels for custom layouts
Lion-themed Design: Golden gradient aesthetics with smooth animations
Dark Mode: Eye-friendly dark interface for studio environments
Responsive Design: Works on desktop and large tablets

🚀 Quick Start

Prerequisites

Frontend:

Node.js 18+
npm or yarn package manager

AI Backend (Optional but Recommended):

Python 3.8+
ffmpeg (for audio processing)
NVIDIA GPU with CUDA (optional, for faster AI processing)

🎯 Frontend Setup

Clone the repository

git clone <repository-url>
cd Lion-s-Roar-Studio

Install dependencies
```
npm install
```

Set up environment variables

# Copy the example file
cp .env.local.example .env.local

# Edit .env.local and add your API key
GEMINI_API_KEY=your_gemini_api_key_here

Start the development server
```
npm run dev
```
Open your browser
- Navigate to http://localhost:3000
- Enjoy the karaoke experience!

🤖 AI Backend Setup (For Advanced Features)

Navigate to the server directory
```
cd server
```

Create a Python virtual environment

# Windows
python -m venv venv
.\venv\Scripts\activate

# macOS/Linux
python3 -m venv venv
source venv/bin/activate

Install AI dependencies
```
pip install -r requirements.txt
```
Note: This will install PyTorch, Demucs, Whisper, and other AI models. The download may take several minutes and require ~2-4GB of disk space.
Start the AI server
```
python main.py
```
Verify the setup
- AI Server: http://localhost:5000/health
- Should show all dependencies as available

📋 Available Scripts

Frontend Commands

npm run dev          # Start development server
npm run build        # Build for production
npm run preview      # Preview production build
npm run build-css    # Build Tailwind CSS (watch mode)

Backend Commands

# In the server directory
python main.py                    # Start AI server
python -m pytest                 # Run tests (if available)
pip install -r requirements.txt  # Install/update dependencies

🏗️ Project Structure

Lion-s-Roar-Studio/
├── 📁 components/               # React components
│   ├── 📁 ui/                  # Reusable UI components
│   ├── 📁 windows/             # Draggable window components
│   ├── ControlHub.tsx          # Main audio control center
│   ├── LibraryUI.tsx           # Song library management
│   ├── MainStudio.tsx          # Primary karaoke interface
│   ├── SongEditor.tsx          # Lyrics editing with timestamp display
│   ├── LoadingScreen.tsx       # Application loading interface
│   └── VideoSplashScreen.tsx   # Intro video component
├── 📁 context/                 # React context providers
│   └── AppContext.tsx          # Global app state management
├── 📁 server/                  # Python AI backend
│   ├── main.py                 # Flask server with AI models
│   ├── requirements.txt        # Python dependencies
│   └── README.md               # Backend documentation
├── 📁 utils/                   # Utility functions
│   └── music.ts                # Audio processing and timestamp utilities
├── 📁 types.ts                 # TypeScript type definitions (TimedLyric, Song, etc.)
├── 📁 public/                  # Static assets
│   ├── 📁 assets/              # Images and videos
│   ├── favicon.svg             # Custom lion favicon
│   └── favicon.ico             # Fallback favicon
├── 🎨 index.css               # Tailwind CSS with custom styles
├── ⚛️ App.tsx                 # Main React application
├── 🔧 tailwind.config.js      # Tailwind CSS configuration
└── 📦 package.json            # Project dependencies

🎵 How to Use

Basic Karaoke

Upload Songs: Click "Upload File(s)" or drag & drop audio files
Select a Song: Choose from your library or use the demo songs
Start Singing: Click play and sing along with the lyrics
View Your Score: Real-time scoring appears as you sing

Enhanced Lyrics Experience

Transcribe with Timestamps: Use Advanced Tools → "Transcribe Lyrics" to generate timestamped lyrics
Song Editor: View both plain lyrics and timed segments with precise timing display
Karaoke Display: Enjoy word-by-word highlighting synchronized with audio
Auto-scrolling: Lyrics automatically scroll to keep current line centered
Multi-language Support: Automatic language detection for international songs

Smart Audio Processing

Intelligent Vocal Removal: Advanced AI creates clean instrumental versions
Library Integration: Seamlessly replaces songs with processed versions
Instant Results: Optimized processing pipeline for immediate feedback
Quality Preservation: Maintains original audio quality and metadata

Advanced AI Features

Smart Vocal Separation:
- Select a song → Advanced Tools → "Create Instrumental"
- Replaces original song with clean instrumental version
- Perfect for karaoke with confirmation dialog for safety
Advanced Stem Separation:
- Advanced Tools → "Separate All Stems"
- Creates 4 separate tracks: vocals, drums, bass, and other instruments
- Adds new songs to library for individual track practice
Timestamped Lyric Transcription:
- Advanced Tools → "Transcribe Lyrics"
- AI generates accurate lyrics with precise timestamps
- View results in Song Editor with timed segments
- Enjoy synchronized karaoke-style display during playback
- Supports multiple languages with automatic detection

Audio Mixing

Open Mixer: Click the mixer icon in the control hub
Adjust EQ: Use the 10-band equalizer for perfect sound
Apply Effects: Add reverb, echo, or other audio effects
Auto-Tune: Enable pitch correction for enhanced vocals

🔧 Configuration

Environment Variables

# .env.local
GEMINI_API_KEY=your_gemini_api_key_here  # For AI vocal coaching
AI_SERVER_URL=http://localhost:5000      # Local AI server

Processing Behavior

Vocal Separation: Replaces original song with instrumental version
Stem Separation: Creates 4 new songs (vocals, drums, bass, other)
Transcription: Adds timestamped lyrics to existing song
Confirmation Dialogs: Vocal separation shows warning before replacing original

Tailwind CSS Customization

The app uses custom Tailwind classes for theming:

.lion-button   /* Golden gradient buttons */
.lion-input    /* Dark themed inputs */
.lion-card     /* Consistent card styling */

🤝 Contributing

We welcome contributions! Please see our Contributing Guide for details.

Development Workflow

Fork the repository
Create a feature branch: git checkout -b feature/amazing-feature
Commit your changes: git commit -m 'Add amazing feature'
Push to the branch: git push origin feature/amazing-feature
Open a Pull Request

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Demucs: AI-powered source separation by Meta Research
OpenAI Whisper: Robust speech recognition system
Google Gemini: Advanced AI for vocal coaching features
React & Vite: Modern web development framework
Tailwind CSS: Utility-first CSS framework

🆘 Support

Troubleshooting

Frontend Issues:

Ensure Node.js 18+ is installed
Clear browser cache and restart dev server
Check browser console for error messages

AI Backend Issues:

Verify Python 3.8+ and ffmpeg are installed
Check server health at http://localhost:5000/health
Review server console for dependency errors

Performance Tips:

Use NVIDIA GPU with CUDA for faster AI processing
Close unused applications during AI processing
Ensure sufficient disk space (4GB+ recommended)
Transcription with timestamps works best with clear audio
Use high-quality audio files for better lyric accuracy
Vocal separation works best with stereo recordings
Confirm processing choices as vocal separation replaces original songs

Getting Help

📧 Email: support@lionsroar.studio
🐛 Bug Reports: Create an Issue
💬 Discussions: GitHub Discussions

🦁 Made with ❤️ for karaoke enthusiasts worldwide

Unleash your inner lion and roar with confidence!

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE		LICENSE
README.md		README.md
README_EXPANSION.md		README_EXPANSION.md

License

Almir-ctrl/kp-frontend

Folders and files

Latest commit

History

Repository files navigation