Skip to content

Latest commit

 

History

History
34 lines (27 loc) · 1013 Bytes

index.rst

File metadata and controls

34 lines (27 loc) · 1013 Bytes

Hyperion: Speaker Recognition Toolkit

Hyperion is a Speaker Recognition Toolkit based on PyTorch and numpy. It provides:
  • x-Vector architectures: ResNet, Res2Net, Spine2Net, ECAPA-TDNN, EfficientNet, Transformers and others.
  • Embedding preprocessing tools: PCA, LDA, NAP, Centering/Whitening, Length Normalization, CORAL
  • Several flavours of PLDA back-ends: Full-rank PLDA, Simplified PLDA, PLDA
  • Calibration and Fusion tools
  • Recipes for popular datasets: VoxCeleb, NIST-SRE, VOiCES
.. toctree::
   :maxdepth: 4
   :caption: Contents:

   getting-started.rst
   numpy.rst
   torch.rst
   io.rst
   utils.rst


Indices and tables