pyannote-audio

Neural building blocks for speaker diarization:

speech activity detection
speaker change detection
overlapped speech detection
speaker embedding
speaker diarization pipeline

Installation

# Step 1: This creates a conda environment with python3.6
$ conda create env -f environment.yml
$ source activate pyannote

# Step 2:  install from source in the "develop" branch
$ git clone https://github.com/pyannote/pyannote-audio.git
$ cd pyannote-audio
$ git checkout develop
$ pip install .

If you're using Shennong for feature extraction, use the environment-shennong.yml environment config file. If you're willing to use a regular pip environment, make sure you have PyYAML installed in that env, and run Step 2.

Citation

If you use pyannote.audio please use the following citations.

Speech activity and speaker change detection

@inproceedings{Yin2017,
  Author = {Ruiqing Yin and Herv\'e Bredin and Claude Barras},
  Title = {{Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks}},
  Booktitle = {{18th Annual Conference of the International Speech Communication Association, Interspeech 2017}},
  Year = {2017},
  Month = {August},
  Address = {Stockholm, Sweden},
  Url = {https://github.com/yinruiqing/change_detection}
}

Speaker embedding

@inproceedings{Bredin2017,
    author = {Herv\'{e} Bredin},
    title = {{TristouNet: Triplet Loss for Speaker Turn Embedding}},
    booktitle = {42nd IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2017},
    year = {2017},
    url = {http://arxiv.org/abs/1609.04301},
}

Speaker diarization pipeline

@inproceedings{Yin2018,
  Author = {Ruiqing Yin and Herv\'e Bredin and Claude Barras},
  Title = {{Neural Speech Turn Segmentation and Affinity Propagation for Speaker Diarization}},
  Booktitle = {{19th Annual Conference of the International Speech Communication Association, Interspeech 2018}},
  Year = {2018},
  Month = {September},
  Address = {Hyderabad, India},
}

Tutorials

⚠️ These tutorials assumes that you installed the develop branch of pyannote.audio.
⚠️ They are most likely broken in pyannote.audio 1.x.

Documentation

Part of the API is described in this tutorial.
Other than that, there is still a lot to do (contribute?) documentation-wise...

Name		Name	Last commit message	Last commit date
Latest commit History 1,498 Commits
pyannote		pyannote
scripts/auto_lr		scripts/auto_lr
tutorials		tutorials
.gitattributes		.gitattributes
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
environment-shennong.yml		environment-shennong.yml
environment.yml		environment.yml
setup.cfg		setup.cfg
setup.py		setup.py
versioneer.py		versioneer.py

License

jsalt-coml/pyannote-audio

Folders and files

Latest commit

History

Repository files navigation

pyannote-audio

Installation

Citation

Tutorials

Documentation

About

Resources

License

Stars

Watchers

Forks

Languages