Generate Subtitles & Diarize Speakers in Davinci Resolve using AI.
-
Updated
Jun 29, 2025 - TypeScript
Generate Subtitles & Diarize Speakers in Davinci Resolve using AI.
Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
Open source inference code for Rev's model
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.
Official repository for Mamba-based Segmentation Model for Speaker Diarization
EchoInStone is an audio processing tool that transcribes, diarizes, and aligns speaker segments from audio files, prioritizing accuracy and reliability.
PyAnnote Voice Activity Detection (ONNX version)
Transcription from mp3 files to html with or without embedded player
A package that can be locally executed to generate minutes in Japanese
Faster Whisper with Speaker Diarization
you feed in a video; it outputs context contained clips resized to 9:16, keeping speaker in center
Hobby project to transcribe audio files from meetings to transcripts with a summary
Subtitle generation w/ Speaker Diarization using Whisper and pyannote.audio
Toolkit for using Whisper to transcribe YouTube videos. Includes Whisper transcription of YouTube videos, conversion of YouTube video into HuggingFace dataset (using audio and subtitles) and evaluation of Whisper transcription against YouTube subtitles
Companion repository to the paper "On the calibration of powerset speaker diarization models" published at Interspeech 2024
Add a description, image, and links to the pyannote topic page so that developers can more easily learn about it.
To associate your repository with the pyannote topic, visit your repo's landing page and select "manage topics."