Speaker Diarization

pyannote-audio is an open-source toolkit written in Python for speaker diarization.

pyannote-onnx is used to convert the pretrained model defined in PyTorch into the ONNX format and then run it with ONNX Runtime (in C++ or Python).

Only Python 3.8+ is supported.

Usage

Download the pretrained model from Hugging Face pyannote/segmentation-3.0.
Export the pretrained model to ONNX model.
Run the ONNX model with ONNX Runtime in C++ or Python.

$ pip install torch onnx https://github.com/pyannote/pyannote-audio/archive/refs/heads/develop.zip
$ python export_onnx.py pytorch_model.bin segmentation-3.0.onnx

$ pip install -r requirements.txt
$ python main.py data/test_16k.wav

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.github/workflows		.github/workflows
data		data
pyannote_onnx		pyannote_onnx
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
VERSION		VERSION
export_onnx.py		export_onnx.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github/workflows

.github/workflows

data

data

pyannote_onnx

pyannote_onnx

LICENSE

LICENSE

MANIFEST.in

MANIFEST.in

README.md

README.md

VERSION

VERSION

export_onnx.py

export_onnx.py

requirements.txt

requirements.txt

setup.py

setup.py

Repository files navigation

Speaker Diarization

Usage

About

Releases

Packages

Languages

License

pengzhendong/pyannote-onnx

Folders and files

Latest commit

History

Repository files navigation

Speaker Diarization

Usage

About

Resources

License

Stars

Watchers

Forks

Languages