A professional-grade, browser-based voice recording and processing studio with AI-powered creative tools.
Voice Tweaker Studio is a comprehensive web application for recording, editing, and enhancing voice recordings with advanced audio processing capabilities. Built entirely with client-side web technologies, it offers professional-grade voice recording, real-time audio effects, AI-powered content generation, and video integration - all without requiring server infrastructure.
- Multi-take recording with punch & roll capabilities
- Real-time voice effects (pitch shift, reverb, compression, etc.)
- Voice preset system for quick character switching
- Advanced audio processing (noise reduction, normalization)
- AI Script Writer - Generate custom scripts from prompts
- AI Story Generator - Create detailed stories with context
- AI Character Voice Analyzer - Extract character voice profiles
- AI Voice Preset Generator - Create voice presets from descriptions
- Video import and timeline synchronization
- Subtitle creation and editing with timestamps
- Multi-language translation with 20+ supported languages
- Translation memory system for consistent translations
- Multi-track mixer with volume, mute, solo controls
- Track reordering and length adjustment
- Background music looping for continuous ambiance
- Waveform visualization and editing
- Local storage for recordings and projects
- Export options for audio, video, and subtitles
- Folder organization system
- Framework: Next.js with React 18 (App Router)
- UI: Tailwind CSS with shadcn/ui components
- Audio Processing: Web Audio API
- Storage: IndexedDB and LocalStorage
- AI Integration: Google Gemini API
- Video Processing: HTML5 Video API
- State Management: React Context API and custom hooks
- Modern web browser (Chrome, Firefox, Edge recommended)
- Google Gemini API key (for AI features)
-
Clone the repository: ```bash git clone https://github.com/yourusername/voice-tweaker-studio.git cd voice-tweaker-studio ```
-
Install dependencies: ```bash npm install ```
-
Create a
.env.localfile with your API keys: ``` NEXT_PUBLIC_GEMINI_API_KEY=your_gemini_api_key_here ``` -
Start the development server: ```bash npm run dev ```
-
Open http://localhost:3000 in your browser.
- Click the "Record" button to start recording
- Speak into your microphone
- Click "Stop" when finished
- Preview your recording with the "Play" button
- Save or discard the recording
- Navigate to the AI Script Writer panel
- Enter a prompt (e.g., "A comedy scene between two friends at a coffee shop")
- Click "Generate Script"
- Review and edit the generated script
- Click "Load to Transcript" to prepare for recording
- Record your voiceover or import existing audio
- Add background music tracks
- Adjust volumes, start times, and durations
- Enable looping for background tracks if needed
- Reorder tracks as necessary
- Export the final mix
Contributions are welcome! Please feel free to submit a Pull Request.
This project is licensed under the MIT License - see the LICENSE file for details.
- Made with ❤️ by criptichood
- Built with Next.js and React
- UI components from shadcn/ui
- AI capabilities powered by Google Gemini