Skip to content

Latest commit

 

History

History

wespeaker

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 

WeSpeaker

Input

Two audio files.

Example
input1: example/00001_spk1.wav
input2: example/00024_spk1.wav

(Wav file from https://huggingface.co/spaces/wenet/wespeaker_demo)

Output

Degree of similarity.

The speakers are 84.8% similar
Welcome, human!

Requirements

This model recommends additional module.

pip3 install torch torchaudio

Usage

Automatically downloads the onnx and prototxt files on the first run. It is necessary to be connected to the Internet while downloading.

For the sample audio,

$ python3 wespeaker.py --input1 examples/00001_spk1.wav --input2 examples/00024_spk1.wav

Specify two audio files with the --input1 and --input2 options.

By specifying the --english option, it uses the English model and the --chinese option for the Chinese.

$ python3 wespeaker.py --input1 AUDIO_FILE1 --input2 AUDIO_FILE2 --english

Reference

Framework

Onnxruntime

Model Format

ONNX opset=14

Netron

voxceleb_resnet34.onnx.prototxt
cnceleb_resnet34.onnx.prototxt