Concert Scribe

Classify audio in concert recordings into segments of silence, talking, music, and applause.

Takes video or audio files as input, extracts the audio, runs it through Google's YAMNet model, and produces a simple text file describing the timeline.

Example output

0.0-3.36: talking
3.36-33.12: silence
33.12-37.44: applause
37.44-50.4: silence
50.4-108.96: music (Cello)
108.96-118.56: silence
118.56-274.56: music (Cello, Piano)
274.56-285.6: silence
285.6-365.76: music (Cello)
365.76-377.28: applause
377.28-381.6: silence

With --verbose, instrument durations are included:

118.56-274.56: music (Cello: 82.6s, Piano: 15.4s)

Install

pip install concert-scribe

Or with pipx:

pipx install concert-scribe

Requires ffmpeg on the system for audio extraction.

Usage

# Single file
concert-scribe recording.mp4

# All videos in a directory
concert-scribe /path/to/videos/

# Custom output directory
concert-scribe recording.mp4 -o /path/to/output/

# Include per-instrument durations
concert-scribe recording.mp4 --verbose

How it works

Extracts audio from video via ffmpeg (mono, 16kHz)
Classifies each 0.48s frame using YAMNet (521 AudioSet classes mapped to 4 categories)
Merges adjacent same-category frames into segments
Filters out short spurious segments (< 1.5s for music/talking, < 2s for silence)
Deduplicates music sub-types using the AudioSet hierarchy (keeps only the most specific instrument)
Writes a .txt file per input clip

License

Apache-2.0

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
docs/superpowers		docs/superpowers
images		images
src/concert_scribe		src/concert_scribe
tests		tests
.gitignore		.gitignore
AGENTS.md		AGENTS.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Concert Scribe

Example output

Install

Usage

How it works

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Concert Scribe

Example output

Install

Usage

How it works

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages