Vocal Analyzer

A tool to analyze vocals from audio files using AI-powered transcription and feature extraction.

Features

Extract vocals from audio files (MP3/WAV)
Extract all available stems (vocals, drums, bass, etc.)
Transcribe vocal content
Analyze vocal range and characteristics
Generate detailed analysis reports with AI insights

Installation with uv

uv is a fast Python package installer and resolver.

Install uv (if not already installed)

curl -LsSf https://astral.sh/uv/install.sh | sh

Install Vocal Analyzer

# Clone or navigate to the project directory
cd /path/to/va

# Install the project with uv
uv pip install -e .

Configuration

Vocal Analyzer can be configured using a TOML configuration file to enable/disable features and customize behavior.

Config File Locations

The tool looks for config files in the following order:

Path specified with --config argument
config.toml in the current directory
~/.config/vocal-analyzer/config.toml

Creating a Config File

Copy the example config and customize it:

cp config.example.toml config.toml
# Edit config.toml to enable/disable features

See config.example.toml for all available options.

Usage

Basic Usage - Extract and Analyze Vocals

va path/to/audio.mp3

This will:

Extract vocals from the audio file
Transcribe the vocals
Analyze vocal features (pitch, range, etc.)
Generate an analysis report

Output files will be created in a new directory: audio-analysis/ next to your input file.

Using a Custom Config File

va path/to/audio.mp3 --config my-config.toml

Extract All Stems

To extract all available stems (vocals, drums, bass, etc.) instead of just vocals:

va path/to/audio.mp3 --all-stems

Specify Output Directory

va path/to/audio.mp3 -o /path/to/output

List Available Models

To see all available separation models and their supported stems:

va --list-models

Use a Specific Model

va path/to/audio.mp3 --model htdemucs_6s.yaml

Default model: htdemucs_6s.yaml (supports 6-stem separation including vocals, drums, bass, guitar, piano, and other)

Quiet Mode

va path/to/audio.mp3 -q

Example

# Analyze vocals from a song
va ~/Downloads/song.mp3

# Extract all stems using the default model
va ~/Downloads/song.mp3 --all-stems

# Use a different model
va ~/Downloads/song.mp3 --all-stems --model model_bs_roformer_ep_317_sdr_12.9755.ckpt

Output

The tool creates an analysis directory containing:

*_vocals.wav - Extracted vocal track (or multiple stem files with --all-stems)
*_analysis.txt - Detailed analysis report including:
- Transcription
- Vocal range analysis
- AI-powered insights on vocal style and technique

Requirements

Python >= 3.11
Dependencies are managed via pyproject.toml
OpenAI API key (set as environment variable OPENAI_API_KEY)

Development

# Install in editable mode with uv
uv pip install -e .

# Run directly
python -m vocal_analyzer.main path/to/audio.mp3

License

See LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.beads		.beads
vocal_analyzer		vocal_analyzer
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
config.example.toml		config.example.toml
pyproject.toml		pyproject.toml
test-config.toml		test-config.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vocal Analyzer

Features

Installation with uv

Install uv (if not already installed)

Install Vocal Analyzer

Configuration

Config File Locations

Creating a Config File

Usage

Basic Usage - Extract and Analyze Vocals

Using a Custom Config File

Extract All Stems

Specify Output Directory

List Available Models

Use a Specific Model

Quiet Mode

Example

Output

Requirements

Development

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Vocal Analyzer

Features

Installation with uv

Install uv (if not already installed)

Install Vocal Analyzer

Configuration

Config File Locations

Creating a Config File

Usage

Basic Usage - Extract and Analyze Vocals

Using a Custom Config File

Extract All Stems

Specify Output Directory

List Available Models

Use a Specific Model

Quiet Mode

Example

Output

Requirements

Development

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages