gemini-audio-engineer-react

A vibe-coded Mix Assistant AI that provides "expert" audio engineering feedback:

React (Vite) frontend: Upload audio, waveform playback with region selection (WaveSurfer), chat-based consultation
FastAPI backend: Trims selected audio regions, generates Mel spectrograms, and provides AI-powered mixing advice
Multi-model support: Choose between Gemini (with spectrogram analysis) or OpenAI GPT Audio models

Features

🎵 Audio Analysis: Upload WAV, MP3, or FLAC files and select specific regions for analysis
📊 Spectrogram Generation: Visual frequency analysis to identify issues
🤖 AI Consultation: Get professional mixing and mastering advice from AI models
💬 Chat Interface: Follow-up conversations to drill deeper into specific issues
🎛️ Multiple Models: Support for Gemini 3, Gemini 2, and OpenAI GPT Audio models

1) Backend Setup

cd backend
python -m venv .venv

# Windows:
.venv\Scripts\activate

# macOS/Linux:
# source .venv/bin/activate

pip install -r requirements.txt

Environment Configuration

Copy the example environment file and configure your API keys:

# Windows:
copy .env.example .env

# macOS/Linux:
cp .env.example .env

Then edit .env with your API keys:

# Required for Gemini models (gemini-3-pro, gemini-3-flash, gemini-2.0, etc.)
GEMINI_API_KEY=your_gemini_api_key_here

# Required for OpenAI GPT Audio model
OPENAI_API_KEY=your_openai_api_key_here

# Optional: Only needed if FFmpeg is not in your system PATH
FFMPEG_PATH=

Note: You only need to configure the API key(s) for the model(s) you plan to use.

FFmpeg Dependency

FFmpeg is required for audio processing. Install it based on your OS:

Windows (WinGet):

winget install Gyan.FFmpeg

macOS (Homebrew):

brew install ffmpeg

Linux (apt):

sudo apt install ffmpeg

If FFmpeg is installed but the app can't find it, set the FFMPEG_PATH variable in your .env file to point to the directory containing ffmpeg.exe (Windows) or ffmpeg binary.

Run the API

uvicorn app:app --reload --port 8000

Health check: Open http://localhost:8000/health

2) Frontend Setup

cd frontend
npm install
npm run dev

Open http://localhost:5173

Usage

Upload Audio: Select a WAV, MP3, or FLAC file
Select Region: Click and drag on the waveform to select the section you want analyzed
Choose Model: Select from available Gemini or OpenAI models
Enter Prompt: Describe what you want feedback on (e.g., "Check the overall frequency balance")
Start Analysis: Click the button to get AI-powered mixing advice
Follow Up: Use the chat to ask follow-up questions about specific issues

Model Differences

Model	Spectrogram	Thinking Mode	Best For
Gemini 3 Pro	✅	✅	Deep analysis with visual + audio
Gemini 3 Flash	✅	✅	Fast analysis with visual + audio
Gemini 2.0 Thinking	✅	✅	Complex problem-solving
Gemini 2.0 Flash	✅	❌	Quick responses
GPT Audio	❌	❌	Audio-only analysis

Notes

"Preview" generates only the spectrogram (no AI call) so you can see the visual first
"Start Analysis" trims audio + generates spectrogram + sends to AI for comprehensive feedback
Gemini models receive both the audio file and spectrogram image for analysis
GPT Audio model receives only the audio (does not support image input)

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
backend		backend
frontend		frontend
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

gemini-audio-engineer-react

Features

1) Backend Setup

Environment Configuration

FFmpeg Dependency

Run the API

2) Frontend Setup

Usage

Model Differences

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

gemini-audio-engineer-react

Features

1) Backend Setup

Environment Configuration

FFmpeg Dependency

Run the API

2) Frontend Setup

Usage

Model Differences

Notes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages