Instantly generate AI-powered subtitles on your device. Works standalone or connects to DaVinci Resolve.
-
Updated
Aug 17, 2025 - TypeScript
Instantly generate AI-powered subtitles on your device. Works standalone or connects to DaVinci Resolve.
Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
Open source inference code for Rev's model
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.
Official repository for Mamba-based Segmentation Model for Speaker Diarization
EchoInStone is an audio processing tool that transcribes, diarizes, and aligns speaker segments from audio files, prioritizing accuracy and reliability.
Transcription from mp3 files to html with or without embedded player
PyAnnote Voice Activity Detection (ONNX version)
A package that can be locally executed to generate minutes in Japanese
speech to text gui for different Whisper models and backends, including whisper.cpp, mlx-whisper, faster-whisper, ctranslate2; applies pyannote for diarization
Faster Whisper with Speaker Diarization
you feed in a video; it outputs context contained clips resized to 9:16, keeping speaker in center
Hobby project to transcribe audio files from meetings to transcripts with a summary
Subtitle generation w/ Speaker Diarization using Whisper and pyannote.audio
Companion repository to the paper "On the calibration of powerset speaker diarization models" published at Interspeech 2024
Add a description, image, and links to the pyannote topic page so that developers can more easily learn about it.
To associate your repository with the pyannote topic, visit your repo's landing page and select "manage topics."